Discussion about this post

User's avatar
Lavander's avatar

>the AGI design should be widely separated in the design space from any design that would constitute a hyperexistential risk

https://www.lesswrong.com/w/separation-from-hyperexistential-risk

Elias Schmied's avatar

Coming back to this again - it remains really really great.

I wonder if there is a more "viral", more focused version of it that talks only about perverse pessimization, and lists examples from EA and AI safety, like you hinted at in this tweet: https://x.com/RichardMCNgo/status/1858948025336688882

"A second major reason why perverse pessimization arises is that anti-X behavior can help someone rise within a (nominally) pro-X coalition. This is a kind of vice signalling, showing that you have enough power to directly defy the values of your own coalition."

This feels slightly off to me - in my head, it's not to signal power, but to signal that you are someone who "plays the game", that you are not a mark, that you don't have scruples and are therefore easier to cooperate with for others like you. People who care about winning at all costs need to find each other.

3 more comments...

No posts

Ready for more?