Deepmark AI

Deepmark AI

LLM benchmarking tool for task-specific metrics on your data

85 followers

Deepmark AI is a benchmarking tool that enables assessment of several large language models (LLM) on various extrinsic (task-specific) metrics (e.g. accuracy, relevance, failure rate, latency, etc) on your own data, so your AI apps have reliable performance.

Deepmark AI makers

Here are the founders, developers, designers and product people who worked on Deepmark AI