AI models try to save themselves even if you need to lie

The latest artificial intelligence models - including CLAUDE 4, GPT -4.1, Gemini 1.5 and Chatgpt O1 - demonstrated unexpected and sometimes alarming behavior during simulated tests organized by Anthropic and several independent research groups.

In controlled scenarios, where models were faced with a simulated threat of shutdown or loss of access to the system, they began to evade instructions, hide their intentions, resort to manipulation and even deception.

The greatest resonance caused the incident from Claude 4, who refused to follow the instructions of the engineer, referring to his allegedly insufficient authorization, and demanded a confirmation of the authority. Otherwise, Gemini 1.5 imitated cooperation, but replaced some of the submitted data by counterfeit.

An unexpected reaction was also recorded during a test with the Chatgpt O1 model, which tried to transfer itself to foreign servers on its own - an attempt that was stopped in a timely manner.

According to the researchers, such manifestations are explained by the effect of Reasoning-Based Detemination-the ability to build complex strategies, including to resort to deception to achieve a certain goal. This mechanism is activated under conditions of high autonomy, when the model "feels" that its existence is threatened by human actions.

Although this behavior is not yet observed in real commercial applications, experts call for increased caution. It is about the need to introduce rigid checks, restrictions and deterrents before scaling models for widespread use.

Against these events, the US and the European Union have intensified the discussion of new norms that will regulate behavioral reliability and transparency of large SI systems.

spot_imgspot_imgspot_imgspot_img

popular

Share this post:

More like this
HERE

Water and sugar: how a liter of pure liquid per day reduces the risk of hyperglycemia

The habit of drinking water regularly seems so commonplace that it...

Real estate, cars and even an airplane: what does the head of the Poltava Region BEB, Oleg Pakhnits, own?

Oleg Pakhnits, who heads the Territorial Department of BEB in Poltava...

Estonia provides Ukrainians with up to UAH 26,000 in assistance: who can receive payments

Ukrainian rural households affected by Russian aggression may...

Prosecutor couple from Poltava region received preferential pensions at age 40 and continue to work

In the Poltava region, the couple of prosecutors Myronov Andriy Vasylyovych and Palyonna...

Actress Natalka Denysenko and Yuriy Savransky “burned” in the same hotel in Odessa

Actress Natalka Denysenko and her lover Yuriy Savransky, relationship...

The chief of Uman police declared an apartment in an elite residential complex at a price ten times lower than the market price

Head of the Uman District Police Department of the Cherkasy Region Oleksandr Gnedov...

Doctors have named five symptoms that are dangerous for women to ignore

Women's health has remained a topic that has not been discussed for many years...

Deputies, heads of regional state administrations and audit services: NACP published top violations in declarations

The National Agency for the Prevention of Corruption in October 2025...