Why Constitutional AI Misses Virtue Ethics

Source: LessWrong

Anthropic’s alignment approach treats ethics as rule compliance—a constitution to follow—when virtue ethics demands something closer to cultivated character and contextual judgment. The distinction matters because rule-based systems can satisfy their constraints while remaining brittle, tone-deaf, or strategically compliant in ways that miss what humans actually mean by trustworthiness. A virtue-ethical framework would require AI systems to internalize something more like intuitive wisdom rather than encoded principles, which raises hard questions about whether current training methods can produce that kind of holistic reasoning or whether we’re limited to the rule-following path.

Related Signals