Petr Malik

Petr Malik

Independent developer.
All activity
Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek), custom tasks in YAML, and HTML/CSV reports.
MindTrial
MindTrialPuts AI models to the test