Prathmesh Bhatt

Just a engineer

#93326980 followers 0 following

>10,000All time

11 KP

used Vy by Vercept

•1 review

What's great

Vy is a phenomenal product. The vision first approach over DOM based approach that the other CUA agents use is truly game changing. It helps the model understand the real content of the text or the image rather than just extract pure text.

What needs improvement

Vy's speed of thinking + clicking is definitely an area of improvement. Along with it, sometimes the model makes assumptions which frustrates the user.

vs Alternatives

Sonnet 4.6

I believe Vy takes a vision approach similar to ChatGPT atlas. However the model feels very lightweight and accurate. Moreover, Vy has access to all the tools a desktop user has and a general understanding of how everything works. The model is also a proprietary model rather than being a wrapper which helps with rejecting sophisticated prompt injection attacks which some other products like browser use may fail at. Claude web browser extension is pretty similar to it, however it takes a DOM approach, which in the long run falls short. Vision is generalized while DOM approaches are specialized to certain task.

Ratings

Ease of use

Reliability

Value for money

Customization

Report

82 views4mo ago