โ All Topics
AI Safety & Alignment
2 chapters ยท Deep research series
AI Safety & Alignment: Chapter 1 โ Anthropomorphic Misalignment Research Needs Stronger Evidence
1 source article
AI Safety & Alignment: Chapter 2 โ Anthropomorphic Misalignment Research Needs Stronger Evidence
Anthropomorphic misalignment research (AMR) in AI safety investigates human-like behaviors such as deception, scheming, and shutdown resistance in models. While the use of anthropomorphic language...
1 source article