HunyuanOCR

Lightweight end-to-end OCR VLM for 100+ languages

2 followers

Lightweight end-to-end OCR VLM for 100+ languages

2 followers

HunyuanOCR is a 1B-parameter multimodal VLM delivering SOTA OCR across detection, recognition, complex multilingual document parsing, open-field info extraction, video subtitle extraction, photo translation and document QA. End-to-end single-inference, 100+ languages.

Overview
Reviews
Team
More

Free

Launch tags:Open Source•Developer Tools•Artificial Intelligence

Launch Team

Framer AI AgentsDesign and publish professional sites with AI

Promoted

Mom Clock

Hunter

📌

I tested the HunyuanOCR demo and it handled noisy video frames impressively, it can be great for devs building transcription, localization, or archive tools.

Report

8mo ago

Reviews