Edit model card

Programming Skills Learning Path Model

This model is a fine-tuned version of the base mdoel designed to generate path of learning a skill based on input text. It's particularly useful for identifying emerging trends and skill combinations in the rapidly evolving tech landscape.

Usage & Limitations

llama.cpp demo

The model is intended for:

  • Deploying in limited CPU resource, with average about 40 tps on 1 CPU core

The model has limits:

  • The dataset might not capture the very latest tools development in programming world
  • Chatbot usecase does not fit the model usecase
  • The model only return the response as JSON list.

Please note that this model was trained on a custom dataset and may reflect biases present in that data.

Training Hyperparameters

  • Batch Size: 4
  • Optimizer: Experimental GrokAdamW

Little Training Metrics

Eval Loss Eval Runtime Eval Sample Per Seconds Eval Steps per Seconds Loss on Train

Downloads last month
71
Safetensors
Model size
152M params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Model tree for mengmeong/meng-programming-skill-finetune

Quantized
this model

Dataset used to train mengmeong/meng-programming-skill-finetune