Adept

Adept

Useful General Intelligence

107 followers

Adept transforms text in actions. It aims to change the way we use computers, changing the interactions from GUI to NL.
This is the 2nd launch from Adept. View more
Fuyu-8B

Fuyu-8B

A multimodal architecture for AI agents
Fuyu-8B is a multimodal model capable of...
🖼️ Visual Question Answering
🖼️ Image Captioning
🖼️ Text localization and more!
Fuyu-8B gallery image
Free
Launch Team
AppSignal
AppSignal
Built for dev teams, not Fortune 500s.
Promoted

What do you think? …

Chris Messina
Very cool new open source LLM with these capabilities: - Understanding diagrams, charts, and graphs - Doing OCR on screens - Outputting bounding boxes for the locations of objects on screens - Answering UI-based questions
Kenichi Nakahara
Interesting! Is there any technical papers to describe this model and dataset?
Julien Ergan
Very impressive, congrats to the Adept team and open-source contributors. @naoto_shibata_morph @keita_mitsuhashi_morph charts understanding capabilities might be of interest.
Congratulations Team Fuyu-8B on your successful launch on Producthunt. Your multimodal model is very impressive! For enhancement, how about considering a feature that offers insights about the emotional context of the image, making image captioning more interactive and empathetic? Good luck moving forward!
Mathew Simpson
Congrats on the launch!
Tornike Tsiramua
Congrats on the launch! well designed and sophisticated landing page.
Rami
Looking good! Might use in my next app!
12
Next
Last