home All News open_in_new Full Article

Alibaba launches AI model that can process images and video on phones and laptops

The multimodal Qwen2.5-Omni-7B model is designed to run locally on mobile devices and tops rivals in some benchmarks.



Alibaba launched Qwen2.5-Omni-7B, a multimodal AI model with 7 billion parameters, capable of processing text, images, audio, and video on mobile devices. The open-source model is available on Hugging Face, GitHub, and ModelScope, and outperforms previous Alibaba models in audio and image benchmarks.

today 3 d. ago attach_file Culture

attach_file Politics
attach_file Politics
attach_file Politics
attach_file Politics
attach_file Politics
attach_file Politics
attach_file Politics
attach_file Culture
attach_file Politics
attach_file Politics
attach_file Economics
attach_file Politics
attach_file Events
attach_file Politics
attach_file Economics
attach_file Events
attach_file Culture
attach_file Politics
attach_file Politics
attach_file Politics


ID: 1326737450
Add Watch Country

arrow_drop_down