Discover how MetaSAEs use joint training with decomposability penalties to create more atomic, interpretable sparse autoencoder latents for safer AI models...
Enhance Vision Transformer efficiency with sparse autoencoder latents for dynamic head pruning, boosting accuracy and interpretability in computer vision m...