Huawei’s new technique reduces artificial intelligence memory 70 %

Huawei. A open source method called SINQ It has iroduced developers to shrink large language models and reduce their memory consumption by up to 5 %. This way, advanced artificial ielligence models can be implemeed on cheap hardware.

One of the biggest obstacles to the widespread use of large language models is their large size and their inevitable need for memory and computational power. Implemeation of these models usually requires very expensive graphics processors (such as the NVIDIA A100 or H100 for tens of thousands of dollars) and the high costs of cloud servers. This makes it difficult to access powerful artificial ielligence for researchers, startups and smaller companies.

One way to solve this problem is a process called quaity. In this method, the numerical accuracy of the model is reduced (similar to the reduction of the quality of an image to reduce its volume). This reduces memory consumption and speeds up, but its large risk is the decline in quality and the accuracy of the model’s output.

Huawei’s new and open source technique, called SINQ, is designed precisely to solve this problem. This method reduces memory consumption by 5 % to 5 %, without making a significa drop in output quality.

Huawei’s new way to run artificial ielligence on cheap systems

Reducing memory consumption in the new method means that a model that previously needed more than 2 GB of memory can now run on a system with about 1 GB of memory. In practice, this means you can use a graphics card like the NVIDIA GeForce RTX 4090 (priced at around $ 2) instead of a $ 4,000 H100 processor. This reduction in cost of using cloud servers is as significa.

Huawei's new method for artificial ielligence

The poi is that Huawei has made this method a completely open source. SINQ with Apache 2.0 licensed on GitHub and Hugging Face, which means that any individual or company around the world can use this code completely for free, change it, and even use it in its commercial products.

By dramatically reducing hardware and financial obstacles, Huawei gives the world -class developme power to work with larger and more powerful models and create a new wave of innovation in smart apps and services.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Huawei’s new technique reduces artificial intelligence memory 70 %

Huawei’s new way to run artificial ielligence on cheap systems

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe