Ferret

Ferret

Refer and ground anything anywhere at any granularity

5.0
1 review

207 followers

A new type of multimodal large language model (MLLM) from Apple that excels in both image understanding and language processing, particularly demonstrating significant advantages in understanding spatial references.

No makers yet

It looks like there are no makers for this product.