Emergent Misalignment” in LLMs: a taste of the forbidden sets LLM on the road to damnation

https://www.schneier.com/blog/archives/2025/02/emergent-misalignment-in-llms.html

“The emergent properties of LLMs are so, so weird”

The emergents are why predicting near-term AI outcomes is so hard. Outcomes range from barely nudging GDP to destroying the world economy.