HunyuanOCR

HunyuanOCR

Lightweight end-to-end OCR VLM for 100+ languages

2 followers

HunyuanOCR is a 1B-parameter multimodal VLM delivering SOTA OCR across detection, recognition, complex multilingual document parsing, open-field info extraction, video subtitle extraction, photo translation and document QA. End-to-end single-inference, 100+ languages.
HunyuanOCR gallery image
HunyuanOCR gallery image
HunyuanOCR gallery image
Free
Launch Team