Davit Buniatyan

Activeloop AI Knowledge Agent - Deep Research on Your Multi-Modal Data

Need to find answers to hard questions across multiple sources, including your private data? Use our Knowledge Agents, powered by AI search to scan up to billions of rows of any data - images, PDFs, text, tables and more, and provide a well-researched answer.

Add a comment

Replies

Best
Karen Khachikyan

Congrats on the launch, @david_buniatyan! I saw Deep Lake works with all kinds of data. How do you handle something like videos or images compared to text?

Davit Buniatyan

@karen_khachikyan2 thanks for the question - I think the best use case for AI Knowledge Agent on our end is images, PDFs, text, and tables at the moment. AI Knowledge Agent can't readily process videos (unless you convert it into sequences of images). The temporal component is a blocker.


However, if you're using Deep Lake outside of Knowledge Agent use case (i.e. to train ML models), it's readily available to sue with videos and we have multiple customers using it that way. In that case, it depends on what the use case is (and videos can be handled in their raw format, or sequence of images format, etc.).


Does this answer your question? Happy to dive deeper into it.

André J

What about my local files on my computer? Or is it cloud first? I might need to sync files to dropbox or google cloud first maybe?

Davit Buniatyan

@sentry_co good question. in the video demo you can see the example of local data upload (4:03)! What would be your use case?

Kay Kwak

OCR-free retrieval of documents, images, and videos? This truly feels like the next era of AI-driven data utilization! Huge congratulations on your launch! 🎉

Davit Buniatyan

@kay_arkain thank you so much, Kay! You're absolutely right.

Mikayel Harut
@kay_arkain thank you!
Traun Leyden

Looks like a very powerful tool! Does it support searching over Google Drive as well?

Emanuele

@tleyden Thanks for the question! Not yet, but we're planning to add other connectors soon.

Davit Buniatyan

@tleyden for now, you can integrate Dropbox as well as your favorite cloud provider (AWS, GCP, Azure).

Sargis Karapetyan

Behind every great product there is a great team.
Congrats Davit and the team with the launch.

Davit Buniatyan

@sargis_karapetyan2 thanks a lot, Sargis!

Mikita Aliaksandrovich

Congrats on the launch of Deep Lake AI Knowledge Agent! The ability to perform deep research across multiple data types and sources is impressive!

Davit Buniatyan

@mikita_aliaksandrovich thanks a lot, Mikita. How would you use it?

Mikita Aliaksandrovich

@david_buniatyan You're welcome! I'd use Deep Lake AI Knowledge Agent for tasks like analyzing large datasets across various formats, such as research papers, financial reports, and customer feedback!

Narek Galstyan

This is exciting! Compelling demo!

I am curious how effective Deep Lake's integrated knowledge retrieval approach is for avoiding hallucinations and finding relevant articles not found by other tools in the same space?

Davit Buniatyan

@ngalstyan4 good question!

I wouldn't say it's possible to completely avoid hallucinations. Hallucinations happen for two reasons: wrong context, wrong answer by model, and right context, but still a wrong answer by a model. In the latter case, we can't do much. But we focus on making the former case obsolete!

How we do this:

  1. Query planning and gathering context from various datasets.

  2. Querying flexibility (choose to do hybrid, vector, keyword search, etc.)

  3. Multi-modality (on ingestion, gaining more depth of insight into what data is about - what is contained in figures, for instance), which helps pass more imoprtant context to the model.

We also learn over time what queries you consider correct, which helps further improve search experience and increase retrieval accuracy. No other vendor can handle this, as well as #3 as well as we do!

Denis 🐝
This is way to strong guys 😮‍💨
Davit Buniatyan

@denisss haha, not sure how you mean this exactly but thank you! :)

Mikayel Harut

@denisss thanks!

Nico Essi

Considering the most valuable data tends to be in-house, this is amazing 🤩 Great work, @mikayel_harut !

Mikayel Harut

@nicozensara thank you so much <3

Gerasim Hovhannisyan

Your data is your ultimate competitive advantage! Leveraging it effectively isn’t just an option anymore - it’s the key to staying ahead. Exciting to see solutions like Activeloop Agent unlocking its full potential, driving smarter decisions, and creating real impact!


How does it handle data quality and relevance when dealing with diverse sources ?

Emanuele

@gerasimh Thank you for your message! The system is based on a multimodal retrieval system, capable of obtaining the most relevant information in response to the user's query.

Through a process of data analysis and aggregation, it can provide surprisingly accurate answers. All of this is made possible thanks to the performance and flexibility offered by our database, Deep Lake.

Davit Buniatyan

@gerasimh one more additional point to Emanuele's - we learn from user queries over time to suggest more relevant information! And additionally, one surprisingly good way of increasing response quality is vision-language models -> OCR pipelines while performing well, are slightly clunky... Having an end-to-end neural search helps to get full context from the data across modalities, increasing response quality.