Model Details
Model Description
Finetune of LLaMa 3.2 1B model to include flashnormalization (https://arxiv.org/abs/2407.09577)
- Developed by: OpenMachine Labs
- License: MIT
- Finetuned from model Meta LLaMa 3.2 1B
Model Sources [optional]
- Repository: https://github.com/meta-llama/llama-models/tree/main/models/llama3_2
- Paper https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/
Uses
How to Get Started with the Model
Use the code below to get started with the model.
Speeds, Sizes, Times
[More Information Needed]
Evaluation
[More Information Needed]
Metrics
[More Information Needed]
Results
[More Information Needed]
Summary
Model Examination [optional]
Model Card Authors
Nils Graef ([email protected])
Drew Wasielewski ([email protected])
- Downloads last month
- 32
Model tree for drewwas/OpenMachine_FlashNorm
Base model
meta-llama/Llama-3.2-1B