AI models try to save themselves even if you need to lie

The latest artificial intelligence models - including CLAUDE 4, GPT -4.1, Gemini 1.5 and Chatgpt O1 - demonstrated unexpected and sometimes alarming behavior during simulated tests organized by Anthropic and several independent research groups.

In controlled scenarios, where models were faced with a simulated threat of shutdown or loss of access to the system, they began to evade instructions, hide their intentions, resort to manipulation and even deception.

The greatest resonance caused the incident from Claude 4, who refused to follow the instructions of the engineer, referring to his allegedly insufficient authorization, and demanded a confirmation of the authority. Otherwise, Gemini 1.5 imitated cooperation, but replaced some of the submitted data by counterfeit.

An unexpected reaction was also recorded during a test with the Chatgpt O1 model, which tried to transfer itself to foreign servers on its own - an attempt that was stopped in a timely manner.

According to the researchers, such manifestations are explained by the effect of Reasoning-Based Detemination-the ability to build complex strategies, including to resort to deception to achieve a certain goal. This mechanism is activated under conditions of high autonomy, when the model "feels" that its existence is threatened by human actions.

Although this behavior is not yet observed in real commercial applications, experts call for increased caution. It is about the need to introduce rigid checks, restrictions and deterrents before scaling models for widespread use.

Against these events, the US and the European Union have intensified the discussion of new norms that will regulate behavioral reliability and transparency of large SI systems.

spot_imgspot_imgspot_imgspot_img

popular

Share this post:

More like this
HERE

Former head of Energoatom and energy sector elite leave Ukraine

Some of the elite leaders of the Ukrainian energy industry are quietly leaving the country. Recently...

Deputy Minister of Defense Yevhen Moysiuk instructed his subordinates to systematize information about “Syrsky’s failures”

As reported by our sources in the Office of the President, the Commander-in-Chief of the Armed Forces of Ukraine...

Anti-corruption authorities are checking the income of the deputy head of the BEB in Volyn

Deputy Head of the Bureau of Economic Security in the Volyn region Serhiy...

A scheme to steal 5 million UAH from Ukrgazvydobuvannya was exposed in the Poltava region

The Security Service of Ukraine, together with the Bureau of Economic Security, exposed...

The Kyiv City Council session on September 12 will take place, but in an “abbreviated format”

According to sources, the session of the Kyiv City Council on September 12 will still take place,...

The Court of Cassation overturned the decision on the possible demolition of the "Holosiivska Tower"

The Kyiv Economic Court of Cassation overturned the previous court's ruling, which...

The State Bureau of Investigation conducted searches at Kubrakov's residence in connection with the We Build Ukraine Foundation case

Former Minister of Infrastructure Oleksandr Kubrakov is closing his We...

Bribes in coffee boxes: the State Bureau of Investigation uncovered a corruption vertical in probation

Employees of the State Bureau of Investigation, together with the Department of Strategic Investigations...