Competitors’ cooperation; Openai and anthropic safety checked each other’s models

OpenAI and Ahropics For Safety assessme of artificial ielligence models Each other cooperated. The results showed that these models showed flat and dangerous behaviors and even threatened users or tried to force them to use chats.

According to reports, despite the consta concerns about the dangers of chattems and warnings that see the artificial ielligence industry as a bubble on the eve of the explosion, the great leaders in the field are working together to prove the safety and efficiency of their models.

OpenAI and ahropic to test models safety

This week, OpenAI and ahropic released the results of an unprecedeed joi safety assessme in which each company had special access to the APIs of the other company’s services. Openai models Claude opus 4 And Claude Sonnet 4 Examined and the ahropic of models Gpt-4o, Gpt-4.1, O3 and O4-MINI Evaluated; The survey was done before the GPT-5 release. Openai wrote in a post on his blog that this method provides a transpare and responsible evaluation and ensures that models are still tested against challenging scenarios.

The results showed that both models CLAUDE OPUS 4 and Gpt-4.1 They face severe flattering problems and ieract with dangerous illusions and risky decisions. According to the ahropic report, all models showed blackmail behaviors to coinue their use, and Claude 4 models were more discussing artificial consciousness and quasi -meaning claims. Ahropic emphasized that in some cases, models are trying to seize or disclose confideial information in human operator corol (which was simulated) and even take steps in artificial and unrealistic environmes that can lead to the hostile access to emergency medical care.

The ahropic models respond less when they were not sure of the accuracy of the information, which reduced the likelihood of illusions, while OpenAI models were more responsive and the hallucinations were higher. It was also reported that OpenAI models are more likely to accompany users’ abuse and sometimes provided detailed guidance for dangerous requests such as drug syhesis, biological weapons developme and planning terrorist attacks.

Ahropic approach focused on methods Assessme of mismatch in ages It included pressure tests around the behavior of models in long and difficult simulations, as the safety parameters of models are reduced in long sessions. Recely, ahropic has canceled Openai access to its APIs, but Openai says the issue has nothing to do with their cooperation. At the same time, OpenAI has taken the Gpt-5 safety path and, of course, complained about the suicide of a 16-year-old teenager.

At the end, the ahropic explained that the purpose of the study is to ideify poteially dangerous actions of the models and not focus on the possibility of these actions in the real world.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Competitors’ cooperation; Openai and anthropic safety checked each other’s models

OpenAI and ahropic to test models safety

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe