News from the AI & ML world
Alexey Shabanov@TestingCatalog
//
Alibaba Cloud has unveiled Qwen 3, a new generation of large language models (LLMs) boasting 235 billion parameters, poised to challenge the dominance of US-based models. This open-weight family of models includes both dense and Mixture-of-Experts (MoE) architectures, offering developers a range of choices to suit their specific application needs and hardware constraints. The flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, and general knowledge, positioning it as one of the most powerful publicly available models.
Qwen 3 introduces a unique "thinking mode" that can be toggled for step-by-step reasoning or rapid direct answers. This hybrid reasoning approach, similar to OpenAI's "o" series, allows users to engage a more intensive process for complex queries in fields like science, math, and engineering. The models are trained on a massive dataset of 36 trillion tokens spanning 119 languages, twice the corpus of Qwen 2.5 and enriched with synthetic math and code data. This extensive training equips Qwen 3 with enhanced reasoning, multilingual proficiency, and computational efficiency.
The release of Qwen 3 includes two MoE models and six dense variants, all licensed under Apache-2.0 and downloadable from platforms like Hugging Face, ModelScope, and Kaggle. Deployment guidance points to vLLM and SGLang for servers and to Ollama or llama.cpp for local setups, signaling support for both cloud and edge developers. Community feedback has been positive, with analysts noting that earlier Qwen announcements briefly lifted Alibaba shares, underscoring the strategic weight the company places on open models.
ImgSrc: www.testingcata
References :
- Gradient Flow: Qwen 3: What You Need to Know
- AI News | VentureBeat: Alibaba launches open source Qwen3 model that surpasses OpenAI o1 and DeepSeek R1
- TestingCatalog: Alibaba Cloud debuts 235B-parameter Qwen 3 to challenge US model dominance
- MarkTechPost: Alibaba Qwen Team Just Released Qwen3
- Analytics Vidhya: Qwen3 Models: How to Access, Performance, Features, and Applications
- www.analyticsvidhya.com: Qwen3 Models: How to Access, Performance, Features, and Applications
- THE DECODER: Qwen3 series from Alibaba debuts with benchmark results matching top competitors
- www.tomsguide.com: Alibaba is launching its own AI reasoning models to compete with DeepSeek
- the-decoder.com: Qwen3 series from Alibaba debuts with benchmark results matching top competitors
- pub.towardsai.net: TAI #150: Qwen3 Impresses as a Robust Open-Source Contender
- Pandaily: The Mind Behind Qwen3: An Inclusive Interview with Alibaba's Zhou Jingren
- Towards AI: TAI #150: Qwen3 Impresses as a Robust Open-Source Contender
- gradientflow.com: Table of Contents Model Architecture and Capabilities What is Qwen 3 and what models are available in the lineup? What are the “Hybrid Thinking Modes� in Qwen 3, and why are they valuable for developers? How does Qwen 3 compare to previous versions and other leading models? What are the advantages of Qwen 3’s Mixture-of-Experts ...
- bdtechtalks.com: Alibaba's Qwen3 open-weight LLMs combine direct response and chain-of-thought reasoning in a single architecture, and compete withe leading models. The post first appeared on .
- bdtechtalks.com: Alibaba's Qwen3 open-weight LLMs combine direct response and chain-of-thought reasoning in a single architecture, and compete withe leading models. The post first appeared on .
- RunPod Blog: Qwen3 Released: How Does It Stack Up?
- www.computerworld.com: The Qwen3 models, which feature a new hybrid reasoning approach, underscore Alibaba's commitment to open-source AI development.
- Last Week in AI: OpenAI undoes its glaze-heavy ChatGPT update, Alibaba unveils Qwen 3, a family of ‘hybrid’ AI reasoning models , Baidu ERNIE X1 and 4.5 Turbo boast high performance at low cost
Classification:
- HashTags: #LLM #opensource #Qwen3
- Company: Alibaba Cloud
- Target: US AI Models
- Product: Qwen 3
- Feature: thinking mode
- Type: AI
- Severity: Informative