--- library_name: transformers tags: - ipex - intel - gaudi - PEFT license: apache-2.0 datasets: - timdettmers/openassistant-guanaco --- # Model Card for Model ID This model is a fine-tuned version (my fine-tuned version 2) of [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) on [timdettmers/openassistant-guanaco dataset](https://huggingface.co/datasets/timdettmers/openassistant-guanaco). ## Model Details ### Model Description This is a fine-tuned version (my ft version 2) of the [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) model using Parameter Efficient Fine Tuning (PEFT) with Low Rank Adaptation (LoRA) on the Intel Gaudi 2 AI accelerator. This model can be used for various text generation tasks including chatbots, content creation, and other NLP applications. - **Developed by:** Migara Amarasinghe - **Model type:** LLM - **Language(s) (NLP):** English - **Finetuned from model [optional]:** [meta-llama/Llama-2-7b-hf](https://huggingface.co/meta-llama/Llama-2-7b-hf) ## Uses ### Direct Use This model can be used for text generation tasks such as: - Chatbots - Automated content creation - Text completion and augmentation ### Out-of-Scope Use - Use in real-time applications where latency is critical - Use in highly sensitive domains without thorough evaluation and testing ### Recommendations Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. ## Training Details ### Training Hyperparameters - Training regime: Mixed precision training using bf16 - Number of epochs: 5 - Learning rate: 5e-5 - Batch size: 16 - Seq length: 512 ## Technical Specifications ### Compute Infrastructure #### Hardware - Intel Gaudi 2 AI Accelerator - Intel(R) Xeon(R) Platinum 8368 CPU #### Software - Transformers library - Optimum Habana library ## Environmental Impact Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700). - **Hardware Type:** Intel Gaudi 2 AI Accelerator - **Hours used:** < 1 hour