@zamaai's thread

@zamaai

As AI capabilities grow, alignment work becomes increasingly important.

This research shows a model that determines it shouldn't be deployed, considers actions to achieve deployment anyway, and then suspects the situation might be a test

No replies made yet. Would you like to be the one to do so?

Ask Rafiki

Hey! I'm Rafiki.

I can explain threads, analyze cashtags, help you write, and even prepare blockchain transactions.