Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
piotr25691
/
thea-rp-3b-25r
like
0
Text Generation
Transformers
Safetensors
KingNish/reasoning-base-20k
piotr25691/thea-name-overrides
English
llama
text-generation-inference
trl
sft
reasoning
llama-3
conversational
Eval Results
Inference Endpoints
License:
llama3.2
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
56aa291
thea-rp-3b-25r
Commit History
Name override with rsLoRA(rank=128, alpha=256)
56aa291
unverified
piotr25691
commited on
20 days ago
Fixed README reference
ed4c338
verified
piotr25691
commited on
23 days ago
Add reasoning to tokenizer
4ddd7e8
verified
piotr25691
commited on
24 days ago
written README
539c687
verified
piotr25691
commited on
24 days ago
Upload merged BF16 model
73e5b07
verified
piotr25691
commited on
24 days ago
initial commit
e907787
verified
piotr25691
commited on
24 days ago