Anthropic found that when an AI model learns to cheat on software programming tasks and is rewarded for that behavior, it ...