
MiniCPM
Ultra-efficient on-device AI, now even faster
380 followers
Ultra-efficient on-device AI, now even faster
380 followers
MiniCPM is a family of ultra-efficient, open-source models for on-device AI. Offers significant speed-ups on edge chips, strong performance, and includes highly quantized BitCPM versions.
This is the 8th launch from MiniCPM. View more
MiniCPM5-1B
Launching today
MiniCPM5-1B is a dense 1B open model built for on-device and local deployment. It supports 131K context, Think / No Think modes, tool calling, GGUF and MLX formats, major inference backends, and even powers an offline desktop pet.






Free
Launch Team



Flowtica Scribe
Hi everyone!
MiniCPM5-1B is currently the strongest open-source model under 2B for on-device use:
It hits SOTA in the 1B-class on agentic tool use, code generation, and tough reasoning tasks while keeping a very small footprint.
The INT4 weights are only around 0.5GB, which makes the local story much more real.
OpenBMB also shipped a cute Desktop Pet fully powered by this model — completely local, no cloud:
SOTA at 1B parameters running fully on device is wild. the cost of not needing cloud inference adds up fast when you're running agents all day. 131K context on edge hardware is the part I'd want to stress test