Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
HuggingFaceTB
's Collections
SmolLM2
π» Local SmolLMs
πͺ SmolLM
Instruct datasets
π Cosmopedia
Find textbooks in FineWeb with a classifier
FineWeb clustering & synthetic generations
Other: Stanford, OpenStax, khanAcademy, wikihow...
FW generation prompts
Wikipedia Science topics
Wikipedia textbooks
SFT Experiments
Decay mixture experiments
models
π Cosmopedia
updated
Aug 18
Resources for Cosmopedia dataset
Upvote
8
HuggingFaceTB/cosmopedia
Viewer
β’
Updated
Aug 12
β’
31.1M
β’
10.8k
β’
561
HuggingFaceTB/cosmo-1b
Text Generation
β’
Updated
Jul 8
β’
775
β’
127
Running
5
πΈοΈ
Web clusters
HuggingFaceTB/cosmopedia-100k
Viewer
β’
Updated
Feb 19
β’
100k
β’
420
β’
39
HuggingFaceTB/cosmopedia-meta
Viewer
β’
Updated
Feb 20
β’
31.1M
β’
56
β’
2
HuggingFaceTB/smollm-corpus
Viewer
β’
Updated
Sep 6
β’
237M
β’
26.4k
β’
239
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections