MindTrial

MindTrial

Puts AI models to the test.

2 followers

Test a single AI language model (LLM) or evaluate multiple models side-by-side. MindTrial supports providers like OpenAI, Google, Anthropic, DeepSeek, Mistral AI, xAI, and Alibaba. You can create your own custom tasks with text prompts, plain text or structured JSON response formats, optional file attachments, and tool use for enhanced capabilities; validate responses through exact value matching or an LLM judge for semantic evaluation; and get results in easy-to-read HTML and CSV formats.

MindTrial makers

Here are the founders, developers, designers and product people who worked on MindTrial