
Deepmark AI
LLM benchmarking tool for task-specific metrics on your data
85 followers
LLM benchmarking tool for task-specific metrics on your data
85 followers
Deepmark AI is a benchmarking tool that enables assessment of several large language models (LLM) on various extrinsic (task-specific) metrics (e.g. accuracy, relevance, failure rate, latency, etc) on your own data, so your AI apps have reliable performance.
Deepmark AI launches
Launch date

Deepmark AI LLM benchmarking tool for task-specific metrics on your data
Launched on November 22nd, 2023
