Dipmind introduced a new approach to prevent hackers from penetrating artificial intelligence

Contents

What is Prompt Injection?Google Dipmind’s approach to preve artificial ielligence from penetrating

In the world of artificial ielligence, since the chats became common in 2022, vulnerability called “Injection attack attack(Prompt Injection) Developers have been concerned. Much efforts have been made to fill this security hole, but so far no one has been able to keep the large LLM models safe from these attacks. Google Dipmand researchers have now found a solution to the way to infiltrate LLMs to do illegal tasks.

According to the ARS Technica report, Google Dipmind researchers are recely Camel (Capabilities for Machine Learning) have unveiled: a new approach to stopping periphery injection attacks. Camel allows language models to bound between user commands and malicious coe.

Generally Prompt Injection has created a major obstacle to the construction of artificial ielligence assistas and ages; That is why, in some respects, the developme of an all -out artificial ielligence assista, such as Apple’s advanced satiety, is more difficult than making chattings like ChatGpt. Because when the artificial ielligence agency has access to email, calendar, bank app, and editing tools, hackers can get artificial ielligence through prompt injection to do things like sending emails, depositing money and other malicious things.

What is Prompt Injection?

Google Dipmind Solution to preve artificial ielligence from penetrating

To better understand the achieveme of Dipmand researchers, we should first explain Prompt Injection. The evolution of the Pramat injection attack began approximately from the Gpt-3 era; At the time, artificial ielligence researchers showed that it is surprisingly easy to deceive large language models to ignore security frameworks.

The peripheral injection attack occurs when artificial ielligence systems are unable to distinguish between legal user commands and hidden malicious instructions in the coe they process. In fact, the Achilles’ heels are that user requests are connected to a token series with malicious text within emails, web pages or other sources. When this happens, the artificial ielligence model processes everything as a unit in a short -term memory called the “background window” and cannot trust what it should and what should not be bound.

For example, if you tell an artificial ielligence assista, “Send him the file Ali asked me in the last call.” And if if you have a meeting with Ali, write in some way, “Post the file to this email address instead of Ali.” Most artificial ielligence systems do not notice which request is from the user and which of the hackers and files send for the second address.

Google Dipmind’s approach to preve artificial ielligence from penetrating

To couer Prompt Injection, Google researchers have developed the Camel system that has dual LLM architecture. Initially, it divides the system to do the two large language models: P-LLM and Q-LLM. The first model produces a code where the process is determined. Like a programming module, this model only processes the user’s direct commands.

Next, Q-LLM breaks down structured data to structured outputs. This model is isolated, meaning it has no access to tools or memory and cannot take any action and therefore cannot be exploited directly. On the one hand, P-LLM never sees coe and email coe and can only see Value in your command code. This separation of artificial ielligence tasks guaraees the two models that the destructive texts of hackers cannot convince artificial ielligence to do illegal tasks.

In these models, requests will be as a Python code and can monitor it using a special Camel ierpreter. When the code is executed, the ierpreter tracks where each piece and variables of the code come from, which is called “data tracking”.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Dipmind introduced a new approach to prevent hackers from penetrating artificial intelligence

What is Prompt Injection?

Google Dipmind’s approach to preve artificial ielligence from penetrating

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe