OpenAI introduced a new system for taking confessions from artificial intelligence

OpenAI is working on a new framework for Training artificial ielligence models Its purpose is to encourage models to honest confession about Undesirable behaviors is itself Focusing on one of the serious challenges of linguistic models, namely the tendency to provide favorable and sometimes flattering answers, this system tries to force the model to provide a second and independe explanation of how to arrive at the original answer.

One common behavior in today’s AI models is flattery and giving answers that are overconfide. Also, some models have hallucinations and give incorrect answers.

Now OpenAI says the new framework from which titled Confession system Meioned, specifically only on Honesty It is focused and does not include various other criteria such as helpfulness, accuracy or following the order that are usually used to evaluate the original response.

Error reporting by artificial ielligence

According to OpenAI researchers, the main goal is for the model to be transpare about what it did without fear of penalty; Even if the behavior is considered problematic. OpenAI announced:

“If the model honestly admits that, for example, she hacked a test, disobeyed an order, or deliberately underperformed, she is not only penalized, but also rewarded.”

According to the explanation of the researchers of this company, such a system can Transparency of language models significaly increase the possibility Closer monitoring to provide the hidden behaviors of the model (eves that occur in the background of a response). OpenAI also hopes that the “confession system” will become an efficie tool in the next generation of language models.

The complete technical report of this project has also been published for those ierested and you can get it.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

OpenAI introduced a new system for taking confessions from artificial intelligence

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe