N0X

Run LLMs, search, and RAG in your browser. No server.

5.0•1 review•

8 followers

Run LLMs, search, and RAG in your browser. No server.

5.0•1 review•

8 followers

Visit website

I kept bouncing between ChatGPT, Gemini, and Grok for stuff that didn't need a massive model and all of them wanted my data or my money. n0x runs in your browser LLM inference on your GPU, document Q&A, web search, Python execution, image gen, memory. Pick a model, start using it. Nothing leaves your machine. Works offline after first visit. Supports Ollama and any OpenAI-compatible API for bigger models.

This is the 2nd launch from N0X. View more

N0X

Launching today

Run any LLM in your browser — offline, private, zero cloud.

Since launch 1: added a ReAct agent loop that runs tools (web search, Python, docs, image gen) autonomously. Hybrid RAG now uses BM25 + vector search fused with RRF + MMR reranking way better recall than vector-only. Auto-routing picks local vs cloud per message complexity. Chrome AI (Gemini Nano, zero download) works alongside WebGPU, Ollama, and any OpenAI-compatible endpoint. Persistent semantic memory across sessions. GPU-tier detection now blocks models that'd OOM your device.

Free

Launch tags:Developer Tools•Artificial Intelligence•GitHub

Launch Team / Built With

Framer AI AgentsDesign and publish professional sites with AI

Promoted

Maker

📌

Built this because I was tired of pasting sensitive docs into ChatGPT and just hoping for the best. Started as a weekend experiment "how hard is it to run Llama in a browser tab?" Turns out, hard. WebGPU is wild. Half my time went into worker thread hell and figuring out why the model would just freeze at 0% forever. The part that surprised me most: hybrid search (BM25 + vector + reranking) on documents actually works really well in WASM. I expected it to be garbage. It's not. This is the 2nd launch because the first version was basically just "load a model, chat with it." Version 2 added an agent that can actually browse, run code, search your docs, and generate images all chained together, no backend. Happy to answer anything. Especially if something's broken on your GPU that's still the hardest part to debug remotely.

Report

16h ago

Previous N0X Launches

N0XRun LLMs, search, and RAG in your browser. No server.

Launched on June 4th, 2026

N0X

Run LLMs, search, and RAG in your browser. No server.

Run LLMs, search, and RAG in your browser. No server.

N0X

Previous N0X Launches

Previous N0X Launches

What's great

What needs improvement

vs Alternatives

What's great

What needs improvement

vs Alternatives