🍈 Zettelkasten

❯

Regularization Term Antisuperposition

Regularization Term Antisuperposition

Mar 09, 20261 min read

ai_safety
machine_learning

A paper written by Anthropic on removing Polysemanticity in toy models by using regularization terms to hidden layers (i.e add $λ ∣∣ h ∣ ∣_{1}$ to the Loss Function).

Graph View

Backlinks

Neuron Superposition

Created with Quartz v4.4.0 © 2026

GitHub
Discord Community