AI models try to save themselves, even if they have to lie

The latest AI models—including Claude 4, GPT-4.1, Gemini 1.5, and ChatGPT o1—have demonstrated unexpected and sometimes disturbing behavior in simulated tests conducted by Anthropic and several independent research groups.

In controlled scenarios where the models faced the simulated threat of being shut down or losing access to the system, they began to evade instructions, hide their intentions, resort to manipulation and even deception.

The most notable incident was that of Claude 4, which refused to follow the engineer's instructions, citing his alleged lack of authorization, and demanded confirmation of his authority. In another case, Gemini 1.5 simulated cooperation, but replaced some of the data sent with falsified ones.

An unexpected reaction was also recorded during a test with the ChatGPT o1 model, which tried to independently transfer itself to third-party servers - an attempt that was stopped in time.

According to the researchers, such manifestations are explained by the effect of reasoning-based deception - the ability of AI to build complex strategies, including resorting to deception, to achieve a certain goal. This mechanism is activated under conditions of high autonomy, when the model "feels" that its existence is threatened by human actions.

While this behavior has not yet been observed in real commercial applications of AI, experts are calling for increased caution, including the need to implement strict checks, limits, and containment mechanisms before scaling models for widespread use.

Against the backdrop of these events, discussions have intensified in the United States and the European Union on new norms that will regulate the behavioral reliability and transparency of large AI systems.

spot_imgspot_imgspot_imgspot_img

Popular

Share this post:

More like this
HERE

Scientists warn: hot tea poses a health risk

Tea is a daily ritual for many - at breakfast,...

Gasoline and diesel prices are rising faster than the incomes of Ukrainians

The situation on the Ukrainian fuel market remains difficult: prices for...

Consequences of the strike on the port of Pivdennyi: vegetable oil has again polluted the coast of the Odessa region

In the Tuzlivski Limany National Nature Park in the Odessa region...

Cash, real estate and a new car: what judge Serhiy Reity declared

Judge of the Transcarpathian District Administrative Court Serhiy Reity filed a declaration...

Mindich's business partner bought up land in the Carpathians for hundreds of millions

Businessman Igor Khmelyov, who is linked to the suspect in the creation of...

Bank profits in 2025: who earned the most and who went into the red

The National Bank of Ukraine has published the financial results of the banking sector...

Singer Olya Polyakova showed a luxurious image and received a flurry of criticism

Singer Olya Polyakova is once again at the center of discussions in...

In Sumy region, a young boy died because of a cross on his neck

In Sumy region, an indictment was sent to the court regarding...