Monkt

Transform files and web pages into AI-ready Markdown or JSON

357 followers

Transform files and web pages into AI-ready Markdown or JSON

357 followers

Visit website

Automation tools

•

LLMs

•

AI Infrastructure Tools

Monkt convert PDFs, Word files, Excel sheets, PowerPoint presentations and web pages into structured Markdown or JSON while preserving semantic structure. Apply custom schemas, process in batches, and use predefined templates through REST API or web interface.

Interactive

Free Options

Launch tags:API•Developer Tools•Artificial Intelligence

Launch Team / Built With

AdaptYour company brain. AI that thinks + acts across your stack.

Promoted

Monkt

Maker

📌

Hey Product Hunt community, I built Monkt to solve a recurring challenge in ML pipelines - converting various document formats into structured data while preserving their semantic structure. After building custom document processing solutions for different projects, I decided to package these patterns into a cloud service. The key features emerged from practical needs: ✔ Convert files/URLs to Markdown or JSON; ✔ Apply custom JSON schemas for validation; ✔ Process documents in batches; ✔ Use predefined templates for common patterns; ✔ Simple REST API integration. Would love your feedback on the approach and use cases you see for structured document processing in ML workflows. Have a great year!

Report

1yr ago

Monkt

Maker

Hey everyone, I appreciate your support; the feedback I've received so far has been amazing. You made my day on January 1st! :D I’ve added some extra capacity since we experienced some spikes in traffic over the last hour. Everything is running smoothly now. I can offer a lifetime discount for anyone who has experienced technical difficulties! Just send me a screenshot through my contact channels. Thank you once again!

Report

1yr ago

Monkt

Maker

@julia_zakharova2 Hi Julia, Thanks a lot for the nice words, appreciated!

Report

1yr ago

Pocket Hansei

Looks great. I am more interested in seeing how exactly are you doing it. There are many open source repos that does this including but not limited to from big companies like Facebook, Microsoft. Do you have a Hackernews discussion for this? or technical discussion somewhere?

Report

1yr ago

Monkt

Maker

@krazygaurav93 Hey Gaurav, Thank you for the nice words. I have published this article with more information: https://medium.com/@simeon.emanu... Will record a few videos in the next two days. I think the API can bring a lot of value, so I will first make a video overview of this part.

Report

1yr ago

Pocket Hansei

@simeon @simeon_emanuilov I see you showed "MarkItDown". It's good but as you correctly pointed out it's not scalable. Also using LLM to parse document is just non sense, it becomes expensive, slow and non scalable. However you have not disclosed your solution there, I understand why. In case you want to brainstorm on the solution for further improvement let me know. Well this is a very common scientific problem you are trying to solve. I personally like "OpenParse" but it has limitations. I then started to use "Marker" which is really good but is commercial and is open for research only. I think OCR is the right way to go, I am thinking of training my own model on huge PDF formats to solve problems like headings, two-side pdf, multi-page tables etc. Best of luck.

Report

1yr ago

Good use case, seeing a lot of interest in Firecrawl and multiOn’s features related to this; are there any particular differences to those services you’d like to make clear for the readers?

Report

1yr ago

Monkt

Maker

@jalcantara Thanks Jorge. The main difference is our focus on ML/AI document processing rather than general web crawling. We preserve complete document structure and semantic relationships for Markdown exports, support custom JSON schemas, and optimize output for ML pipelines.

Report

1yr ago

Hi @simeon_emanuilov Your product is pretty good fit for what I have been looking for. Do you have any plans to come on AppSumo or offer lifetime deal directly?

Report

1yr ago

Monkt

Maker

@kkofficial Hello Kunal, Yes, but in a few months. I appreciate your interest.

Report

1yr ago

Just try this tool, this tool is not supporting very well for a file with chart now. And I guess you use LLM to process the file transforming, this method has some problems such as the content length limitation

Report

1yr ago

Monkt

Maker

@tibelf Hi Leo, Can you share with me some example problematique cases? I would take a look. You can reach me via X, LinkedIn, or support channels. In short: transfer to MD should work without any troubles. Transforming to JSON could be further improved in my opinion, but for relatively short docs -> should work great. Pre-defined prompts could help doing some more complex operations, like summarization, translation, etc. Thanks.

Report

1yr ago

Monkt

Maker

Due to popular demand, I am sharing an article explaining a bit more about the technical approach, why we need such a tool, and what are the open-source options: https://medium.com/@simeon.emanu... I hope it is helpful!

Report

1yr ago

I tried with my 2 files but I do not like output to be honest, My suggestion will to be to imporve quality of output.

Report

1yr ago

Monkt

Maker

@lakhendra Hey Lakhendra, Can you share with me in a message what you tried and what was not good? We will try to improve it. Just released our roadmap, where a lot of enhancements are planned: https://monkt.com/roadmap/ Thanks!

Report

1yr ago

1 2 3

Reviews