The illusion in the new ChatGPT models raised concerns

<1Minutes

Openai has just iroduced its advanced models of artificial ielligence, O3 and O4-Mini, which performs stunning in areas such as coding and solving mathematical problems. But these models are facing a serious problem: hallucinations, that is, producing inaccurate or fabricated information.

Corary to expectation, these models are more illusions than previous OpenAI models, such as O1 and Gpt-4O. This, which even Openai itself does not know the exact reason, has raised concerns about the accuracy of these new technologies.

According to Openai reports, the O3 model produces inaccurate information in 5 % of personal information questions (PERSONQA benchmarks), compared to the previous O1 and O3-MINI models, respectively.

The O4-MINI model has been worse and has an illusory illusions in 5 % of cases. Independe experimes by the Translux laboratory have also shown that O3 sometimes makes unrealistic claims about its accouability process, such as executing the code on an imaginary device. Experts believe that the learning methods used in these models may exacerbate this problem.

This illusory can limit the use of new models in areas such as law or medicine, which is critical accuracy. However, Openai is considering solutions such as adding web search capabilities that can improve accuracy. For example, the Gpt-4O model is up to 5 % with web search. While Openai and the artificial ielligence industry are moving towards advanced reasoning models, solving hallucinations has become a key challenge that requires further research.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

The illusion in the new ChatGPT models raised concerns

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe