Alibaba Pushes Ahead in AI Race with New AI Model
Alibaba has introduced a new addition to its Qwen series AI models, designed to process video, audio, images, and text with remarkable efficiency.
This model is optimised to run directly on laptops and mobile devices, broadening its accessibility and functionality.
Available on GitHub and Hugging Face, it can be integrated into AI agents that provide real-time audio descriptions to assist the visually impaired in navigating their environment.
With a rapid pace of innovation, Alibaba is fully committed to advancing AI technology in 2025, following the recent launch of DeepSeek and an updated version of its AI assistant app, Quart, earlier this year.
The AI Arms Race Intensifies
Alibaba is not alone in the race for multimodal AI innovation.
Rivals like OpenAI and Google, part of Alphabet Inc., are also pushing the boundaries with generative AI tools capable of processing diverse inputs such as text and audio.
Recently, OpenAI enhanced ChatGPT with advanced image generation features, signalling its commitment to expanding its AI capabilities.
The company has also announced plans to significantly increase its investment in AI and cloud computing, outpacing its previous decade of spending.
Alibaba, in turn, is positioning itself as a key partner for companies seeking to leverage AI in practical applications, especially as these models become more complex and require greater computational power.
At the same time, affordable AI services from China are challenging the premium-priced offerings from major US companies, adding pressure to their business models.
However, doubts persist as to whether these emerging AI technologies truly rival or surpass the cutting-edge advancements of Western tech.