@zamaai's thread

zamaai
48
@zamaai
ยท

As AI capabilities grow, alignment work becomes increasingly important.

This research shows a model that determines it shouldn't be deployed, considers actions to achieve deployment anyway, and then suspects the situation might be a test

No replies made yet. Would you like to be the one to do so?