Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
670
15
189
Arthur Zucker
ArthurZ
Follow
travie's profile picture
susen01's profile picture
gary109's profile picture
269 followers
·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Fixing Gradient Accumulation
25 days ago
•
39
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
•
22
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
23
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
8
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistral-community/pixtral-12b
19 days ago
Update model weight
8
#13 opened 23 days ago by
nguyen-brat
New activity in
mistral-community/pixtral-12b
22 days ago
Update hidden_act to silu
2
#14 opened 22 days ago by
ArthurZ
New activity in
rhymes-ai/Aria
about 1 month ago
llama.cpp support
9
#1 opened about 1 month ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
about 1 month ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened about 1 month ago by
dahara1
New activity in
mistral-community/pixtral-12b
about 1 month ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened about 2 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
about 2 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened about 2 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
about 2 months ago
How to use safetensors?
2
#13 opened about 2 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
about 2 months ago
lamma cpp ht to gguf not working
4
#2 opened about 2 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8-kv-heads
8
#14 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
Update config.json
#17 opened 3 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 3 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8 kv heads
2
#13 opened 3 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
8-kv-heads
#15 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
3 months ago
8-kv-heads
3
#21 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
3 months ago
8-kv-heads
4
#17 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
3 months ago
Updated eos_token to include multiple IDs
1
#14 opened 3 months ago by
vontimitta
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Update tokenizer to prepend special token
#12 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
4 months ago
Update tokenizer to prepend special token
1
#11 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-8B-Instruct
4 months ago
Upload tokenizer
2
#29 opened 4 months ago by
ArthurZ
Upload tokenizer
#28 opened 4 months ago by
ArthurZ
Load more