Discussion about this post

User's avatar
Davey's avatar

I like the critiques of consequentalism and deontological principles, but "virtue" still feels like a stand in for the flexible nuanced elusive thing we want AIs aligned to. Like conceptual negative space.

sepiatone's avatar

Would Anthropic's move from the earlier Constitutional AI to the current "soul" document approach be considered moving from a purely deontological (obey rules) approach to a proto-virtue approach (express certain traits like honesty, kindness, helpfulness, and possess a stable disposition)?

No posts

Ready for more?