Zac Zuo

Qwen3-Next - The future of efficient LLMs

Qwen3-Next is a new family of models from the Qwen team, featuring a novel architecture that activates just 3B of its 80B parameters. This delivers performance comparable to much larger models with a >10x speedup, especially on long-context tasks.

Add a comment

Replies

Best
Zac Zuo
Hi everyone! Looking at the benchmarks, the Qwen3-Next-80B-A3B-Instruct is impressively competitive with the massive Qwen3-235B-A22B flagship model. This is beyond just an iteration, it's a new architectural approach. The model's unique design keeps its 80B total parameters but only activates 3B at a time. This allows it to achieve performance comparable to much larger dense models while delivering a >10x inference speedup, especially on long-context tasks over 32K. I'm excited to see if the Qwen team applies this new architecture to even larger parameter models. I'm always thrilled by breakthroughs in foundational model architecture because they usually lead to leapfrog improvements in capability. To the sky, Qwen!