NVIDIA's 550B Mixture-of-Experts model with hybrid Mamba-Attention architecture, delivering 300+ tokens/sec with a 1M-token context window. Top-ranked US open-weights model on the Artificial Analysis Intelligence Index. Built specifically for multi-step agent loops where frontier reasoning at open-source economics actually matters. Available now on Hugging Face, OpenRouter, ModelScope, and build.nvidia.com as a NIM microservice.
A 550B MoE frontier-intelligence open model built for long-running agents.
It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.Ultra excels at complex tasks like coding and deep research.
Long-running agents spend their time planning, using tools, recovering from failures, and deciding what to do next.
We introduce PersonaPlex, a full-duplex conversational AI model that enables natural conversations with customizable voices and roles. PersonaPlex handles interruptions and backchannels while maintaining any chosen persona, outperforming existing systems on conversational dynamics and task adherence.
NVIDIA DLSS 5 introduces a real-time neural rendering model that infuses game pixels with photoreal lighting and materials. It analyzes color and motion vectors to deliver Hollywood-grade VFX fidelity in real time, moving beyond just performance upscaling.
NVIDIA NemoClaw is an open source stack that simplifies running OpenClaw always-on assistants safely. It installs the NVIDIA OpenShell runtime, part of NVIDIA Agent Toolkit, a secure environment for running autonomous agents, with inference routed through NVIDIA cloud.
Nemotron 3 Super is NVIDIA"s open 120B model with 12B active parameters, a 1M-token context window, and a hybrid Mamba-Transformer MoE design. It is built for coding, long-context reasoning, and multi-agent workloads without the usual thinking tax.
Chat With RTX is a demo app that lets you personalize a GPT large language model (LLM) connected to your own content—docs, notes, videos, or other data.
Using text and images as descriptions, developers and visual content creators can use NVIDIA Edify 3D to quickly generate 3D objects to create virtual worlds and prototype ideas.
NVIDIA Jetson Nano enables the development of millions of new small, low-power AI systems. It opens new worlds of embedded IoT applications, including entry-level Network Video Recorders, home robots, and intelligent gateways with full analytics capabilities.