
Preprocess
Preprocess maximises RAG performances
98 followers
Preprocess maximises RAG performances
98 followers
Chunking heavily impacts the performance of your retrieval when dealing with LLMs. Preprocess split documents into optimal chunks of text. We split PDF and Office files based on the original document structure and content semantics.






Preprocess
Breadcrumbs
Congrats @nicola_abbasciano and Team! Super useful solution nowadays to avoid reinventing the wheel in every ai product!
Preprocess
@winrey I can't wait to hear your feedback after you've tried it!
Shram
The focus on automating document preprocessing for LLMs is indeed a crucial step that can save a lot of time and effort for data scientists and developers. The variety of supported document formats and features like intelligent parsing and chunking seem incredibly practical.
Congrats on the launch! Best wishes and sending lots of wins :) @nicola_abbasciano
Fable Wizard
Preprocess looks really useful! Sorting and preparing documents for AI can be a hassle, so having an automated tool sounds like a big help. How well does it handle messy documents with mixed formats?
Preprocess
ehi @jonurbonas , what do you mean by "mixed formats"? Preprocess can handle very complex layouts of the most used documents formats (PDF, Word, PowerPoint, Excel, etc..)