Salesforce unveiled a new language model with the ability to process 1.5 trillion words

nttn

Salesforce has released a new suite of AI-powered tools that can handle massive amous of textual data, up to 1.5 trillion words or tokens. Known as the XGen-7B family of models, these tools can be used specifically to handle unstructured data (data that does not fit neatly io rows and columns, such as text and images) and are much better analyzed and organized than LLAMA meta-models. Have.

nnnn

As more people start using AI tools like ChatGPT, the data fed io these systems will become more complex and structured. This complexity makes it more difficult to use tools like ChatGPT, which are designed to analyze language and text, when the input or data being analyzed does not follow a clear structure. Therefore, there is a growing need for advanced systems that can handle unstructured data and do more to meet the growing demand for artificial ielligence tools.

nnn

Businesses can take advaage of chat systems like ChatGPT or BARD that can provide summaries of long documes or analyze customer data to gain insights. However, for these chat systems to be effective, they need to be trained on huge amous of data. Many businesses opt for smaller, cheaper models of these chat systems, which aren’t always capable of complex tasks like summarizing long documes or scrutinizing customer data. Therefore, since these models cannot handle such complex tasks well, these businesses cannot fully benefit from the benefits of this technology.

nnnn

Source game language models such as LLAMA, Falcon-7B, and meta MPT-7B are not ideal in managing texts or long documes, because they are not able to manage a large amou of texts and can only corol the maximum sequence length of about 2000 tokens or text units. However, the XGen-7B family of language models developed by Salesforce are trained using a technique called “standard dense atteion” and are therefore capable of processing much larger input data, up to 1.5 trillion tokens. . This has made the meioned language models an effective tool for managing and analyzing long documes.

nnnn

Salesforce researchers selected a set of linguistic models with seven billion parameters and trained them using a combination of Salesforce and JAXFORMER data, as well as publicly available training data. This model has achieved better results compared to open source models such as LLAMA, Falcon and Redpajama. The researchers also found that it costs only $150,000 to train a model with 1 trillion tokens using the Google Cloud Computing Platform TPU-V4, which is a more cost-effective and efficie way to train large language models. Thus, researchers have been able to create an advanced AI model that can analyze and process large amous of data more accurately than other open-source alternatives, while keeping the cost of training the model relatively low.

nnn

nntt

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Salesforce unveiled a new language model with the ability to process 1.5 trillion words

Leave a Reply Cancel reply

Editor's Pick

Ban on buying a house in Canada for foreigners from January 2023

Marina Mall in Dubai and a pleasant and memorable shopping experience

Where is Istanbul Greenhouse Park? Address, photos and entertainment

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Other News

Technology

Immigration

Travel

More

Subscribe