meta-llama
/

Llama-3.2-3B-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (12)

Request: DOI

#35 opened 3 days ago by

Chat template is not consistent with documentation?

#34 opened 6 days ago by

Suggestion

#33 opened 7 days ago by

Request: DOI

#32 opened 7 days ago by

Request: DOI

#31 opened 10 days ago by

Request: DOI

#30 opened 12 days ago by

GPTQ 4Bit Llama 3.2-3B-Instruct with 100% Accuracy recovery

#29 opened 13 days ago by

Request: DOI

#28 opened 15 days ago by

Request: DOI

#27 opened 16 days ago by

Request: DOI

#26 opened 17 days ago by

Request: DOI

#24 opened 28 days ago by

Token indices sequence length is longer than the specified maximum sequence length for this model (269923 > 131072)

#23 opened 30 days ago by

what is the chat template?

#22 opened about 1 month ago by

Request: DOI

#21 opened about 1 month ago by

1B and 3B are nice. Please make also an 8B so we can compare it to gemini flash 8B.

#20 opened about 1 month ago by

Issues w/ downloading the model: llama download: error: Model meta-llama/Llama-3.2-3B-Instruct not found

#19 opened about 1 month ago by

Unable to Load Model

#18 opened about 1 month ago by

Extra "assistnat\n\n" at the beginning of the output

#17 opened about 1 month ago by

Adding Evaluation Results

#16 opened about 1 month ago by

roger036

#15 opened about 2 months ago by

Giving contextual messages to sagemaker instance in python

#14 opened about 2 months ago by

MMLU-Pro benchmark

#13 opened about 2 months ago by

Cannot download the model with huggingface-cli

#11 opened about 2 months ago by

Thanks. This is astonishingly good for its size.

#9 opened about 2 months ago by