All activity
Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek), custom tasks in YAML, and HTML/CSV reports.
MindTrialPuts AI models to the test
Petr Malikleft a comment
I’m happy to share the MindTrial project I worked on. MindTrial is a tool for evaluating and comparing AI language models on text-based tasks with optional file/image attachments. Compare multiple AI models side by side (OpenAI, Google, Anthropic, DeepSeek). Create custom test tasks using simple YAML files. Attach files or images to prompts for visual tasks. Fine-tune model behavior with...
MindTrialPuts AI models to the test
