Home
Immigration
New ways to get Canadian permanent residence through Express Entry 2026

Canada
Federal governme Canada Officially announced that from 2026 some new job categories in the system Express Ery It prioritizes selection…
1 Min Read

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

Immigration
Complete review of Ryazan universities for study applicas and educational immigrationStudying in Russia in rece years has become one of…
11 Min Read

ca PGWP golden tips that most Canadian students don’t know

Canada
🔟 Field of Study conditionThis condition applies to people who:are exempted:Other programs must be on the list of governme-approved fields…
1 Min Read

ca

Canada
Example 2: Obtaining permane residence through the Atlaic Immigration Program (AIP)Clara's Status: Clara is 24 years old and is working…
2 Min Read

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Immigration
Iroduction Why is this comparison not simple?For many educational immigration applicas, choosing the destination coury is no longer an idealistic…
8 Min Read

Check out more:
Canada
Travel
Conditions for buying bus tickets Booking guide and bus travel rules

Dubai
Buy bus tickets online Today, it has become one of the main methods of planning iercity trips, but many travelers…
18 Min Read

Introduction of the silver beach of Hormuz (access route + accommodation)

Dubai
Why do some beaches stay in the traveler's mind and not just a poi on the map? The answer can…
18 Min Read

Al Habtoor Palace Dubai Hotel

Dubai
Iroduction of Sun and Sand Hotel Dowown Dubai Stay in the heart of Deira near Al-Raqqa Streethotel Sun and Sand…
2 Min Read

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Travel
Sardar Seyedtimur Hosseini added: Also, due to the high traffic density, the south-north routes of Tehran-Shamal and Karaj-Chalus freeways are…
2 Min Read

Swissôtel Al Ghurair, Dubai

Dubai
I work in "Go to Dubai" with a focus on accurate, clear and tailored trips for Iranian travelers. Our goal…
3 Min Read

Check out more:
Dubai
Technology
ChatGPT’s safety rules need to be revised

artificial-intelligence
The Canadian governme summoned OpenAI executives to Ottawa following a deadly school shooting in British Columbia. Governme officials criticized the…
3 Min Read

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

cryptocurrency
minutesThe price of Ethereum (ETH) has finally managed to return above the psychological level of $2,000 after weeks of selling…
1 Min Read

New Qwen 3.5 open source models released; Suitable for running on personal systems

artificial-intelligence
Alibaba's artificial ielligence developme team iroduced the new Qwen 3.5 series of language models, which brings the features of advanced…
3 Min Read

The Perplexity Computer platform was introduced

artificial-intelligence
Perplexity company from the new platform Computer Perplexity unveiled, which is considered a big step in the evolution of artificial…
3 Min Read

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

artificial-intelligence
Google's new and powerful artificial ielligence called its imager Nano banana 2 (Nano Banana 2) iroduced and released for free…
3 Min Read

Check out more:
Artificial Intelligence
CryptoCurrency
Gadgets
Fashion
FashionShow More

Women’s short home cotton shirt
The fact is that men are not complicated; Your wife will love…
9 Min Read

What effect does putting on perfume before sleep have on the quality of sleep? • Image of life magazine
Sces have a direct effect on the nervous system and human emotions…
7 Min Read

The difference between a stylish and up-to-date make-up and a messy make-up
Makeup, just like clothes, depends on the time and taste of the…
9 Min Read

The best sport type with a tie; How to look well-dressed? • Image of life magazine
A sporty look with a tie looks good for people who wa…
8 Min Read

What is the best Valentine set? All kinds of ideas for buying gift sets on the day of love
As Valeine's Day approaches, choosing a differe and lasting gift becomes one…
9 Min Read
Health
HealthShow More

Food list for treating high blood fat
Food list and sample diet for treating blood lipidsIn today's world, high…
13 Min Read

What is an anal wart? | Symptoms, ways of transmission, methods of treatment and prevention of HPV
Anal wart or condyloma acuminata is one of the most common sexually…
11 Min Read

Serious complications of hair curling + the correct way to curl to maintain hair health
Hair curling and its damageThe serious side effects of curling hair and…
6 Min Read

Is the durability of laminate better or composite?
The beauty of a smile plays an undeniable role in people's self-confidence…
14 Min Read

Benefits of cold showers for men and women + benefits of cold showers for the skin
Benefits of cold showerCold water shower may seem a little hard at…
9 Min Read
Science
ScienceShow More

The condolence message of the Institute of Seismology and Earthquake Engineering after the martyrdom of the leader of the revolution – Mehr news agency RCO News Agency
According to RCO News Agency, the researchers of the Iernational Research Institute…
1 Min Read

Strengthening basic sciences is a prerequisite for improving the university’s research position – RCO News Agency
According to RCO News Agency, citing Amirkabir University of Technology, Abbas Soroush…
3 Min Read

Allocation of 1,500 billion Rials to support cultural and artistic startups – RCO News Agency
According to Mehr news agency, Seyyed Mehdi Sadat Hayatshahi, secretary of the…
5 Min Read

The deadline for sending articles to Royan twin congresses has been announced – RCO News Agency
According to RCO News Agency, citing Royan Research Institute, the 27th Iernational…
2 Min Read

The launch of the “National Elite Foundation Proposal System” in the near future – RCO News Agency
According to Mehr news agency, quoting from the National Elite Foundation, Rasul…
3 Min Read
World
WorldShow More

The simultaneous attacks of Iran and Lebanon’s Hezbollah on the occupied territories – Mehr news agency RCO News Agency
The simultaneous attacks of Iran and Lebanon's Hezbollah on the occupied territories…
1 Min Read

Clash in the US Senate about the war against Iran + video – Mehr news agency RCO News Agency
Clash in the US Senate about the war against Iran + video…
1 Min Read

Zionist media: The crash of an American F-15 fighter jet in the west of Iran – Mehr news agency RCO News Agency
Zionist media: The crash of an American F-15 fighter jet in the…
1 Min Read

An explosion occurred near an oil tanker off the coast of Kuwait – Mehr news agency RCO News Agency
An explosion occurred near an oil tanker off the coast of Kuwait…
1 Min Read

Consultation of the US Minister of Foreign Affairs with his Turkish and Saudi counterparts – Mehr news agency RCO News Agency
Consultation of the US Minister of Foreign Affairs with his Turkish and…
2 Min Read

Reading: Nvidia unveiled its new hybrid architecture for small language models called Hymba

artificial-intelligence

Nvidia unveiled its new hybrid architecture for small language models called Hymba

Last updated: 2026/03/05 at 1:04 PM

Contents

Hymba outperforms Llama-3.2 But caution!

2minutes

NVIDIA unveiled the Hymba-1.5B-Base small language model, a model that combines transformer atteion mechanisms with state space models (SSM). This hybrid architecture is designed to increase efficiency in natural language processing tasks.

Pavel Molchanov, Scieist and Director of Research at NVIDIA, announced this new developme in the X platform. “Sharing our team’s new work on Hymba, a compact and efficie language model with a hybrid architecture,” he wrote on Twitter.

Also, he has published a technical report of this research and explained what are the differences between the Mamba and Atteion models and how can these two models be combined. He also meioned phenomena such as atteion sink and forced-to-attend.

This model uses a dual structure where one part is responsible for accurate information retrieval and the other part helps to summarize the text effectively.

Also, the Hymba model adds learnable tokens at the beginning of inputs to store importa information and reduce the need for additional processing. Finally, to increase memory efficiency and calculation speed, Hymba takes advaage of methods such as sharing data between layers and using a special type of information processing in which the model only focuses on certain parts of the data and ignores the rest.

An article eitled “Hymba: A Hybrid-head Architecture for Small Language Models” has fully explained the design, performance and applications of this model.

Hymba outperforms Llama-3.2

In a corolled study where differe architectures were compared under the same conditions, Hymba-1.5B-Base showed significa advaages. In fact, this model was able to surpass all public models with less than 2 billion parameters.

Compared to Llama-3.2-3B, Hymba model had 1.32% higher accuracy, reduced cache size (temporary memory) by 11.67 times and increased processing speed by 3.49 times.

“Hymba outperforms other small language models such as Meta 3.2 or SmolLM v2, which are trained with only 1.5 trillion tokens,” said Philip Schmid, technical lead and responsible for large language models at Hugging Face.

“Pavel Molchanov” also said about this: “I don’t know if we should be proud of training with 1.5 trillion tokens or not because our goal is to move quickly and probably in the next two weeks someone will have a better model.”

NVIDIA also provides an environme startup script that facilitates environme setup and supports CUDA versions 12.1 and 12.4.

But caution!

Nvidia announced that the model was trained using iernet data. In fact, this data may coain offensive coe, unsafe coe, and social discrimination, so the Hymba model may reflect these problems, give offensive answers to offensive questions, or even generate wrong or irreleva text in response to neutral questions. .

Users should set the data batch size to one when generating, as the curre setting does not fully support a particular way of processing data. However, any size dataset can be used to train the model and populate the data.

The company emphasizes that everyone should have a joi role and responsibility in creating reliable artificial ielligence. Also, certain ethical guidelines have been set for the developme of this technology. In addition, users are asked to use the model responsibly and be aware of its limitations.

TAGGED: architecture, called, Hybrid, Hymba, language, models, Nvidia, small, unveiled

IT Technology March 5, 2026 March 5, 2026

Share This Article

Previous Article

NATO Secretary General accuses Iran and Russia after the fall of the Syrian government

How do foreign tourists describe Saad al-Sultaneh Caravanserai on Google?

Leave a comment

Leave a Reply Cancel reply

UAE immigration

Dominica immigration

Spain immigration

Latest Passing over countries : Spain | Dominica | United Arab Emirates

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

7minutes Buying a business in Canada: a comprehensive guide and iroduction to the best areasWith its stable economy, strong banking…

By Editor-in-chief of Canada 8 Min Read

Dubai metro map 2024

Dubai Metro Map 2024 from introduction to (new download)

Iroducing the Dubai Metro and downloading the map of 2024 metro lines…

28 Min Read

Burj Al Arab restaurants

Burj Al Arab restaurants Instant booking 2024

Purchase options Al lwan restaura – lunch or dinnerSahn Eddar Cafe –…

2 Min Read

Top Writers

Editor-in-chief of Canada 647 Articles

We at Canada RCO News Observatory are responsible for gathering…

TakeOff 4161 Articles

We at RCO NEWS for Travelers of the Takeoff travel…

Oponion

Women's short homemade cotton shirt

Women’s short home cotton shirt

The fact is that men are not complicated; Your wife…

February 27, 2026

You Might Also Like

RCO Daily News

artificial-intelligence

ChatGPT’s safety rules need to be revised

The Canadian governme summoned OpenAI executives to Ottawa following a deadly school shooting in British Columbia. Governme officials criticized the…

3 Min Read

RCO Daily News

artificial-intelligence

New Qwen 3.5 open source models released; Suitable for running on personal systems

Alibaba's artificial ielligence developme team iroduced the new Qwen 3.5 series of language models, which brings the features of advanced…

3 Min Read

RCO Daily News

artificial-intelligence

The Perplexity Computer platform was introduced

Perplexity company from the new platform Computer Perplexity unveiled, which is considered a big step in the evolution of artificial…

3 Min Read

RCO Daily News

artificial-intelligence

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Google's new and powerful artificial ielligence called its imager Nano banana 2 (Nano Banana 2) iroduced and released for free…

3 Min Read