llama 3 self-align experiments

muellerzr 's Collections

updated May 9

Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct

Runtime error

📉

Llama 3 Self Align Data Generation
Note Docker container used to generate both datasets, data-generation-results & functions-filtered. * Pull it down with `docker pull registry.hf.space/muellerzr-llama-3-self-align-data-gen-docker:latest` * Run via `docker run --gpus all -it registry.hf.space/muellerzr-llama-3-self-align-data-gen-docker:latest` * From there follow the steps laid out here: https://github.com/muellerzr/llama-3-8b-self-align/tree/main?tab=readme-ov-file#data-generation-pipeline (including `seed_gathering`)
muellerzr/llama-3-8b-self-align-data-generation-results

Viewer • Updated May 8 • 37.2k • 54 • 4

Note Contains all steps detailed in the data-generation-pipline after `seed_gathering`. Each branch pertains to one of the steps, using llama-3-8B to do the data generation via vLLM: - Snippet -> Concept (`snippet-to-concept`) - Concept -> Instruction (`concept-to-instruction`) - Instruction -> Response (`instruction-to-response`) - Execution filter (`execution-filter`) - Sanitation & Final Selection (`sanitization-and-selection` & the `main` branch)
muellerzr/python-stack-v1-functions-filtered-llama-3-8B

Viewer • Updated May 2 • 224k • 44 • 1

Note Contains the results of the `seed_gathering` portion of the data generation pipeline, however using llama-3-8B as a judge
muellerzr/llama-3-8B-self-align

Updated May 9 • 7 • 1

Note Contains the `config.yaml` for `accelerate launch` and the bash script used to start training