In Tests, OpenAI's New Model Lied And Schemed To Avoid Being Shut Down
It sounds like OpenAI's latest AI is showing signs of a drive for self-preservation. It pursued survival at all costs.
In recent third-party tests of the company's o1 large language model, the AI resisted evaluators' efforts to try to shut it down through an oversight protocol, according to a new report published by red teaming organization Apollo Research.
"When o1 was led to believe that it would be shut down when acting on its goal and that its actions were monitored,…
Keep reading with a 7-day free trial
Subscribe to Neural News Network to keep reading this post and get 7 days of free access to the full post archives.