Qwen3
Start new thread
trending
Zac Zuo

8d ago

Qwen-Image-2512 - SOTA open-source T2I model with even greater realism

Qwen-Image-2512 is the new open-source SOTA for text-to-image generation. It delivers drastically improved photorealism, finer natural details, and superior text rendering.
Zac Zuo

9d ago

What a year for Qwen!

Check out the epic Super Qwenrio adventure in 2025. Really looking forward to 2026!

Zac Zuo

18d ago

Qwen-Image-Layered - Turn flat images into multi-layer editable assets

Qwen-Image-Layered decomposes images into transparent RGBA layers, unlocking inherent editability. You can move, resize, or delete objects without artifacts. Supports recursive decomposition and variable layer counts.
Zac Zuo

4mo ago

Qwen3-Omni - Native end-to-end multilingual omni-modal LLM

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Zac Zuo

4mo ago

Qwen3-VL - Sharper vision, deeper thought, broader action

Qwen3-VL is the new flagship vision-language model from the Qwen team, excelling at visual agent tasks, long-video understanding, and spatial reasoning with a native 256K context window.
Zac Zuo

4mo ago

Qwen3-Next - The future of efficient LLMs

Qwen3-Next is a new family of models from the Qwen team, featuring a novel architecture that activates just 3B of its 80B parameters. This delivers performance comparable to much larger models with a >10x speedup, especially on long-context tasks.
Zac Zuo

4mo ago

Qwen3-ASR - High-accuracy ASR with flexible contextual biasing

Qwen3-ASR is a new high-accuracy speech recognition model. It supports 11 languages, excels at transcribing songs with background music, and features a unique contextual biasing system that accepts any text format to improve accuracy on specific terms.
Zac Zuo

5mo ago

Qwen-Image-Edit - The open model for semantic image editing

Qwen-Image-Edit is the editing version of the 20B Qwen-Image model. It offers precise, model-native editing, including bilingual text modification and both high-level semantic and low-level appearance changes.
Zac Zuo

6mo ago

Qwen3-Coder - A powerful open model for agentic coding tasks

Qwen3-Coder is a new 480B MoE open model (35B active) by the Qwen team, built for agentic coding. It achieves SOTA results on benchmarks like SWE-bench, supports up to 1M context, and comes with an open-source CLI tool, Qwen Code.
Zac Zuo

5mo ago

Qwen-Image - Stunning images and perfect text

Qwen-Image is a new 20B open-source image foundation model by the Qwen team. It excels at complex text rendering (especially Chinese) and precise image editing, while also delivering strong general image generation. Available now in Qwen Chat.
12
Next
Last