tim-lawson
/

mlsae-pythia-1b-deduped-x64-k32-tfm

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions Community

Edit model card

This model has been pushed to the Hub using the PytorchModelHubMixin integration:

Library: https://github.com/tim-lawson/mlsae
Docs: [More Information Needed]

Downloads last month: 0

Safetensors

Model size

1.55B params

Tensor type

F32

·

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Collection including tim-lawson/mlsae-pythia-1b-deduped-x64-k32-tfm

Multi-Layer Sparse Autoencoders with Transformers

Single SAEs trained on the residual stream activation vectors from every transformer layer simultaneously (including the transformers). • 30 items • Updated Oct 7