Margarita Shvetsova

Alconost MQM Annotation Tool - Annotate translation errors, score quality, get PDF reports

Free MQM annotation tool for evaluating translation quality. Mark errors by category and severity, set custom weights, and export as CSV, TSV, JSON, or PDF. No installation and no signup required.

Add a comment

Replies

Best
Margarita Shvetsova
Hi Product Hunt! 👋 MQM (Multidimensional Quality Metrics) is the gold standard for evaluating translation quality — used by WMT benchmarks, adopted by major localization teams. But actually using it? Your options were clunky research tools, expensive enterprise software, or spreadsheets you'd rather not open. So we built the MQM tool we wished existed: professional, browser-based, and free. What you can do with it: ✔️ Evaluate any translation (human or AI) against the full MQM error taxonomy ✔️ Set custom error weights and pass/fail thresholds for your projects ✔️ Add context for annotators: glossary terms, style guides, reference links ✔️ Get quality scores with error distribution charts ✔️ Export data as CSV, TSV, JSON, or PDF reports Who it's for: ⭐ Localization Managers evaluating translation quality and benchmarking MT output ⭐ ML Researchers and Engineers creating gold-standard datasets for quality evaluation models ⭐ Language Service Providers and Linguists offering LQA aligned with MQM standards The tool itself is free, but you need professional linguists to do the annotation work. You can use your own team or ask us. We'll be happy to hear your feedback and questions!
Julia Zakharova

@margarita_s88 Good luck in the race!

Margarita Shvetsova

@julia_zakharova2 Thanks very much, Julia!

Mark Lippert

Hi, I am interested in understanding translation quality for our product, but I haven’t heard of MQM before. When I check translations with other translators, I just need to know whether there are any mistakes or not, right? What’s the deal with MQM?

Nick Zaleski

@lippert thank you for this question! sometimes you just need to know the mistakes. but MQM helps when you need to compare quality objectively, for example:

- Does your localization vendor meet your quality threshold?
- Which MT engine performs best for your content?
- Is a translation with 4 minor errors better or worse than one with 1 major error?
- MQM gives each error a weight based on type and severity, then calculates an overall score.

This turns subjective 'good enough' into measurable 'passed with 98.2'

Nataliya

@lippert Your translator marks a mistake, you ask "What's wrong with this translation?" They anwer: "It doesn't sound natural." or "It feels awkward." These are subjective answers. MQM says what exactly is wrong and in which category (Accuracy/Mistranslation; Fluency/Spelling etc.) + gives severities to these issues and overall score. Now you have an objective answer that you can measure and compare.

Valeriia Avramenko

Is it for machine translation quality evaluation?

Margarita Shvetsova

@valeriiavramenko Thanks for your question, Valeriia! You can use it for annotating both machine translation and human translation.

Julia Zakharova

Hi! Congratulations to the team on launching on PH! 🚀🚀

How is the final score calculated? 🧐

Margarita Shvetsova

@julia_zakharova2 thank you! ^__^

Regarding the final score: each error gets a penalty based on its severity (minor, major, critical) multiplied by configurable weights. The tool calculates penalties per segment and for the whole document, then compares against your pass/fail threshold. You can adjust all the weights to match your quality requirements.

It may sound a bit complicated, but hopefully I've managed to explain it :)

David Shakhbazyan

We're evaluating MT engines for our app. Would this help us compare outputs?

Nick Zaleski

@dshakhbazyan yerrrr, that's one of the main use cases. run the same source content through different MT engines, annotate the outputs using the same criteria, and compare the scores. the reports show error breakdown by category so you can see which engine struggles with terminology vs. fluency, for example

David Shakhbazyan
@nickzaleski thanks, sounds awesome!
Diana Ivanenko

Good luck with the launch, team!

Margarita Shvetsova

@dioiv thank you! :)

Anna Kuznetsova

I’ve played with it a bit and I see the uploaded samples already contain marked errors. Is this some kind of AI that does preliminary annotation job?

Nick Zaleski

@anna_kuznetsova12 No AI involved here.

We uploaded samples that were already annotated by human linguists to show you what marked errors look like. We wanted to give you a quick way to explore the interface without starting from scratch.

Alexandr Shvetsov

Cool, congrats on the launch! Do you plan to support more file formats?

Nick Zaleski

@alex_sh78 actually we chose these formats as they are all you need to do evaluation/LQA kind of tasks.

What file formats did you have in mind?

Annet

Are there any limitations on usage or is the tool totally free?

Margarita Shvetsova

@annet_sukhareva It really is free, and will stay free :) you only need to pay for the work of annotators, be it Alconost’s professional annotators or your in-house team.

Liza Diagel

Interesting tool. You mentioned one can do LQA with it - but LQA is different from just annotating/marking errors.

Nick Zaleski

@liza_diagel that's right, the tool's primary purpose is translation evaluation. but there's also a field for correcting translations, so you can use it for full LQA workflows, not just error marking

12
Next
Last