Alibaba, a Chinese technology giant, unveiled the new QWen series QVQ-Max. This model is the reasoning of the image and can understand the content of photos and videos and provide information about them by analyzing and reasoning.
According to Neoowin, Alibaba says that with the QVQ-Max model, the gap of artificial intelligence-based models and real-world information in the images. This artificial intelligence with visual reasoning can see, understand and think about the realities of the world. The Chinese company claims that the model performs very well in analyzing images and identifying key elements and can be used to illustrate and produce screenplays.
Artificial Intelligence of Alibaba Video
Like other artificial intelligence chats, QVQ-Max can also help you in a variety of tasks, and with this visual feature you can do more; For example, send the graphic and physics issues with the diagrams.
Alibaba has called QVQ-Max the first version of his visual reasoning model and wants to improve it in several stages. Alibaba first wants to improve the accuracy of the image detection. Then improve the model in solving multi -stage and complex problems. Finally, it intends to go beyond text -based interactions and equip it with features such as visual production.
To use QVQ-Max, you must first go to Chat.qwen.ai, click on the model menu at the top left, click “Expand More Models” and select QVQ-Max. To better use the features of this model, it is best to attach the image, then ask the model about it.
Alibaba recently released the QWen2.5-Max model, which performs better in different benchmarks than the Dip-Sick V3, Gpt-4O and Llama-3.1-405B Meta.
RCO NEWS