🍈 Zettelkasten

❯

Default Misalignment

Default Misalignment

Mar 10, 20261 min read

ai_safety
philosophy

The idea that AI systems will be misaligned by default. Motivated by Orthogonality Thesis

For

Value Fragility
Instrumental Convergence Hypothesis

Against

Maybe Orthogonality Thesis is false
Instrumental convergence may cause values in our favor
- How does AI get money? It produces value for someone
- How does AI get people to help? It has to be trustworthy

Graph View

For
Against

Backlinks

AI Alignment
Misalignment
Rogue AI Risk Case

Created with Quartz v4.4.0 © 2026

GitHub
Discord Community