Alpie Core

A 4-bit reasoning model with frontier-level performance

228 followers

A 4-bit reasoning model with frontier-level performance

228 followers

Visit website

Foundation Models

Alpie Core is a 32B reasoning model trained, fine-tuned, and served entirely at 4-bit precision. Built with a reasoning-first design, it delivers strong performance in multi-step reasoning and coding while using a fraction of the compute of full-precision models. Alpie Core is open source, OpenAI-compatible, supports long context, and is available via Hugging Face, Ollama, and a hosted API for real-world use.

Free

Launch tags:API•Developer Tools•Artificial Intelligence

Launch Team

NMI Payments — Don’t Integrate Payments Until You Read This Guide

Don’t Integrate Payments Until You Read This Guide

Promoted

Alpie Core

Maker

📌

Hey builders

Modern AI keeps getting better, but only if you can afford massive GPUs and memory. We didn’t think that was sustainable or accessible for most builders, so we took a different path.

Alpie Core is a 32B reasoning model trained, fine-tuned, and served entirely at 4-bit precision. It delivers strong multi-step reasoning, coding, and analytical performance while dramatically reducing memory footprint and inference cost, without relying on brute-force scaling.

It supports 65K context, is open source (Apache 2.0), OpenAI-compatible, and runs efficiently on practical, lower-end GPUs. You can use it today via Hugging Face, Ollama, our hosted API, or the 169Pi Playground.

To keep you building over Christmas and the New Year, we’re offering 5 million free tokens on your first API usage, so you can test, benchmark, and ship without friction.

This launch brings the model, benchmarks, api access, and infrastructure together in one place, and we’d love feedback from builders, researchers, and infra teams. Questions, critiques, and comparisons are all welcome as we shape v2.

Report

19d ago

@chirag_a2207 This is a solid direction 4-bit end-to-end with 65K context is not easy to get right.

I run a security & adversarial testing practice focused on LLM / API / inference-time risks (prompt injection, jailbreaks, context poisoning, OpenAI-compat compatibility gaps, abuse vectors).

If you’re open to it, I'd be happy to do a free adversarial assessment of Alpie Core and share a short report with findings + mitigations.

No pitch just stress-testing before v2.

Report

15d ago

Alpie Core

Maker

@sujal_meghwal Really appreciate this, and thanks for the kind words. You’re absolutely right, getting 4-bit end-to-end with long context stability is non-trivial.

We’d be open to an adversarial assessment, especially ahead of v2. Stress-testing around prompt injection, jailbreaks, and inference-time risks is something we take seriously. Happy to connect and see how we can collaborate and learn from the findings.

Thanks for offering, will reach out to coordinate soon.

Report

15d ago

@chirag_a2207 Great. When you’re ready, I can share a short scope outlining what we’d test (prompt injection, jailbreak surfaces, long-context abuse, OpenAI-compat edge cases, inference-time abuse) and the format of the report so expectations are clear upfront. Happy to adapt it to whatever stage or constraints you’re working with. Looking forward to working with you and your team

Report

14d ago

@chirag_a2207 A 32B model at 4-bit with strong reasoning is impressive. How do you think about the trade-off between aggressive quantization and reasoning reliability, especially on long, multi-step chains or edge cases where small precision errors can compound?

Report

14d ago

Alpie Core

Maker

@malekmoumtaz That’s a great question, and it’s exactly the trade-off we spent the most time on.

We don’t treat 4-bit as a post-training compression step. Alpie Core is trained, fine-tuned, and served entirely at 4-bit, so the model learns to reason within low-precision constraints instead of being forced into them later. That significantly reduces error accumulation compared to aggressive quantization applied after the fact.

That said, long multi-step chains are still where issues surface first. We’ve found that stability depends less on individual arithmetic precision and more on how the model represents intermediate reasoning states. In practice, this means we actively test for instruction drift, compounding errors, and state collapse across long contexts, and we design training and evaluation around those failure modes.

We’re very cautious about claiming “no trade-offs,” and the goal is to make the trade-offs explicit, measurable, and improve with each iteration, especially for long-horizon and edge-case reasoning. Will be happy to hear your feedback on it after you try it out.

Report

12d ago

@chirag_a2207 Chirag! Thanks for the clarification! Will let you know my feedback soon

Report

11d ago

At first it seemed like it can run on any laptop, hope you guys will keep on optimizing for running on most laptops.

Report

15d ago

Alpie Core

Maker

@koderkashif Good question! Right now, it does need GPU VRAM or a fairly high-end CPU to run locally at this scale. That said, we’re actively optimising it further so it can run on more everyday laptops and eventually even phones over time.

Thank you for your interest. We will address this issue soon. Please stay connected for updates.

Report

15d ago

@chirag_a2207 Appreciate it.

Report

14d ago

Dirac

What is the strenght of this compared to using api from openrouter? Do you think this is better to be used for putting into a product, or development?

Report

14d ago

Alpie Core

Maker

@peterz_shu Good question, Peter. OpenRouter is great for quick experimentation and model comparison. Alpie Core is designed to be a consistent, production-ready model that you can rely on for development and products, with predictable behaviour, lower latency, and improved cost control.

We’ll be available on OpenRouter soon for easy evaluation. For now, we’re offering a free first API key on our website so teams can test it properly and share feedback.

Report

12d ago

Congratulations on the launch! We are actually developing several AI startups based on ChatGPT. We’ll check out your product with our team; it might be applicable to our tasks.

Report

12d ago

Alpie Core

Maker

@mykyta_semenov_ Thank you, appreciate that. Do check it out with your team, as we’d be happy to jump on a call and share more context if helpful. It’s a state-of-the-art reasoning model at this scale, trained and served entirely at 4-bit, so it can be a good fit for real product workflows.

Looking forward to hearing your thoughts.

Report

9d ago

Congrats on the launch 👏 Alpie Core is interesting, especially as an alternative to larger reasoning models like Llama or Qwen that rely on heavier hardware and higher precision. Running a 32B model at 4 bit with strong reasoning sounds promising, but I’m curious how it holds up on tougher multi step reasoning and coding benchmarks where higher precision models usually shine. Are early users seeing better value in cost efficiency, or in being able to run it on lower end GPUs? What’s been the most surprising comparison result so far when benchmarking against other open source reasoning models?

Report

9d ago

Have a question about Alpie Core? Ask it here and get a real answer.

Do you use Alpie Core?

Forum Threads

p/alpie-core

•

2d ago

Try Alpie Core in a full workspace with files, research & collaboration

Hey everyone

Thank you again for the support on Alpie Core, and the feedback from this community meant a lot to us.

Since then, we have finally released Alpie, our most advanced product yet. A full AI workspace where you can now see Alpie Core working in real workflows, and not just isolated prompts. You can use the model with files and PDFs, run research, collaborate with others in shared chats, and keep long-running context organised.

If you were curious how Alpie Core performs beyond single queries, this is where you can try it hands-on.

p/alpie-core

•

4d ago

Python SDK + CLI for Alpie Core are live (sync, async, streaming)

Hey Builders

We have just released the official Python SDK and CLI for Alpie Core, our 32B reasoning model trained and served entirely at native 4-bit precision.

The goal was simple: make it genuinely easy to build, test, and ship with a reasoning model in real-world systems, not just demos.

What s included in the first release:

p/alpie-core

•

15d ago

What would you build or benchmark with 5M free tokens on a reasoning model?

To encourage real experimentation, we re offering 5 million free tokens on first API usage so devs and teams can test Alpie Core over Christmas and the New Year.

Alpie Core is a 32B reasoning model trained and served at 4-bit precision, offering 65K context, OpenAI-compatible APIs, and high-throughput, low-latency inference.

If you were evaluating or using a model like this:
What would you benchmark first?
What workloads matter most to you?
What comparisons would you want to see?

View all

Maker Comment

Alpie Core

Maker

📌

Hey builders

Modern AI keeps getting better, but only if you can afford massive GPUs and memory. We didn’t think that was sustainable or accessible for most builders, so we took a different path.

To keep you building over Christmas and the New Year, we’re offering 5 million free tokens on your first API usage, so you can test, benchmark, and ship without friction.

Report

19d ago

See discussion

Hey builders

Modern AI keeps getting better, but only if you can afford massive GPUs and memory. We didn’t think that was sustainable or accessible for most builders, so we took a different path.

To keep you building over Christmas and the New Year, we’re offering 5 million free tokens on your first API usage, so you can test, benchmark, and ship without friction.

Alpie Core

A 4-bit reasoning model with frontier-level performance

A 4-bit reasoning model with frontier-level performance

Have a question about Alpie Core? Ask it here and get a real answer.

Do you use Alpie Core?

Forum Threads

Try Alpie Core in a full workspace with files, research & collaboration

Python SDK + CLI for Alpie Core are live (sync, async, streaming)

What would you build or benchmark with 5M free tokens on a reasoning model?

Maker Comment

Trending categories

Top reviewed

Trending products

Top forum threads

Trending categories

Top reviewed

Trending products

Top forum threads

Have a question about Alpie Core? Ask it here and get a real answer.

Do you use Alpie Core?

Forum Threads

Try Alpie Core in a full workspace with files, research & collaboration

Python SDK + CLI for Alpie Core are live (sync, async, streaming)

What would you build or benchmark with 5M free tokens on a reasoning model?

Maker Comment