Perceptron Mk1

Frontier video reasoning for the physical world

14 followers

Frontier video reasoning for the physical world

14 followers

Visit website

Foundation Models

Perceptron Mk1 brings frontier video and embodied reasoning to production APIs, with temporal grounding, structured visual outputs, 32K multimodal context, and pricing built for high-volume physical-world tasks.

Free Options

Launch tags:API•Artificial Intelligence•Video

Launch Team

Framer AI AgentsDesign and publish professional sites with AI

Promoted

Flowtica Scribe

Hunter

📌

Hi everyone!

Perceptron Isaac was their strong open-source series. Mk1 (Mark One) is the closed-source flagship — a production-oriented world-understanding model built for the physical world.

It delivers frontier-level video and embodied reasoning: accurate temporal grounding, multi-view understanding, pixel-precise pointing, reliable counting in dense scenes, and strong structured document extraction.

All while running at a much more practical cost than larger frontier models.

Report

2mo ago

Fere AI

This is very cool. This currently works over a video. I am wondering if someone can make an almost real time video analysis using this? Maybe have a streaming video captured into smaller videos -> sent -> analysed -> stored in memory and then repeat until the video is going. What kind of videos are harder to process vs easy use cases?

Report

2mo ago

Reviews