Qwen: Alibaba Open-Source LLM May 2026

Last updated: 2026-05-19

Alibaba's AI assistant with 203M monthly active users, supporting 119+ languages, multimodal input, and models up to 1M token context windows.

Qwen is Alibaba's open-source LLM series, available free under Apache 2.0 for commercial use. Qwen 3.6, released May 2026, features Mixture-of-Experts architecture delivering 2x throughput versus Qwen 2.5, with a 1M token context window, 119+ language support, and six model size tiers. Ideal for enterprises deploying custom AI without per-token licensing costs.

About Qwen

Qwen (short for Tongyi Qianwen) is Alibaba Cloud's family of large language models and chat assistant, first launched in beta in April 2023 and opened to the public in September 2023. It reached 203 million monthly active users by February 2026, a 554% spike in a single month, and became the most-downloaded open-weight model family on Hugging Face with over 700 million downloads by January 2026, surpassing Meta's Llama in cumulative downloads. The model family spans 0.5 billion to over 1 trillion parameters, with both dense and mixture-of-experts (MoE) architectures. The flagship Qwen3-235B model activates only 22 billion parameters per generation step, keeping inference costs low while delivering competitive results. A standout design choice is the hybrid thinking mode: users can toggle between fast non-thinking mode for quick answers and a slower deliberate reasoning mode for complex math, code, or analysis tasks. The Qwen 3.6 Plus Preview extends context to 1 million tokens and matches GPT-5 mini on SWE-bench Verified at 72.4. Qwen Chat is the consumer-facing product, available on web at chat.qwen.ai, and via native apps for iOS, Android, Windows, and macOS. It handles text chat, document processing, image understanding, image generation, video understanding, web search, and code execution in a single interface. The underlying Qwen2.5-Coder model was trained on 5.5 trillion tokens and supports 92 programming languages. API access runs through Alibaba Cloud's Model Studio (DashScope), which also offers an OpenAI-compatible endpoint. Qwen-Flash costs $0.10 per million input tokens, Qwen-Plus costs $0.40 per million input tokens, and Qwen-Max starts at $1.20 per million input tokens. All new API accounts get 1 million free tokens per model valid for 90 days. Over 90,000 enterprises have adopted Qwen models via Model Studio. Qwen models are released under Apache 2.0, letting developers self-host or fine-tune without licensing restrictions. The Qwen Agent framework provides tooling for building multi-step AI workflows. Alibaba released Qwen 3.6-Plus on April 2, 2026, adding stronger coding and agent capabilities, continuing a rapid release cadence that has kept Qwen competitive against Western frontier models despite US chip export restrictions.

Pricing

Free tier: 1M tokens per model for 90 days after activating Model Studio. API pricing: Qwen-Flash $0.10/M input, $0.40/M output. Qwen-Plus $0.40/M input, $1.20/M output (non-thinking). Qwen-Max $1.20/M input, $6.00/M output (0-32K). 50% batch discount available. Qwen Chat consumer app is free to use.

Key Features

Pros

Cons

Visit Qwen Official Website