Alconost MQM Annotation Tool - Annotate translation errors, score quality, get PDF reports

Alconost Localization

•4mo ago

Free MQM annotation tool for evaluating translation quality. Mark errors by category and severity, set custom weights, and export as CSV, TSV, JSON, or PDF. No installation and no signup required.

Replies

Best

Alconost Localization

Hunter

📌

Hi Product Hunt! 👋 MQM (Multidimensional Quality Metrics) is the gold standard for evaluating translation quality — used by WMT benchmarks, adopted by major localization teams. But actually using it? Your options were clunky research tools, expensive enterprise software, or spreadsheets you'd rather not open. So we built the MQM tool we wished existed: professional, browser-based, and free. What you can do with it: ✔️ Evaluate any translation (human or AI) against the full MQM error taxonomy ✔️ Set custom error weights and pass/fail thresholds for your projects ✔️ Add context for annotators: glossary terms, style guides, reference links ✔️ Get quality scores with error distribution charts ✔️ Export data as CSV, TSV, JSON, or PDF reports Who it's for: ⭐ Localization Managers evaluating translation quality and benchmarking MT output ⭐ ML Researchers and Engineers creating gold-standard datasets for quality evaluation models ⭐ Language Service Providers and Linguists offering LQA aligned with MQM standards The tool itself is free, but you need professional linguists to do the annotation work. You can use your own team or ask us. We'll be happy to hear your feedback and questions!

Report

4mo ago

@margarita_s88 Good luck in the race!

Report

4mo ago

Alconost Localization

Hunter

@julia_zakharova2 Thanks very much, Julia!

Report

4mo ago

Hi, I am interested in understanding translation quality for our product, but I haven’t heard of MQM before. When I check translations with other translators, I just need to know whether there are any mistakes or not, right? What’s the deal with MQM?

Report

4mo ago

Maker

@lippert thank you for this question! sometimes you just need to know the mistakes. but MQM helps when you need to compare quality objectively, for example:

- Does your localization vendor meet your quality threshold?
- Which MT engine performs best for your content?
- Is a translation with 4 minor errors better or worse than one with 1 major error?
- MQM gives each error a weight based on type and severity, then calculates an overall score.

This turns subjective 'good enough' into measurable 'passed with 98.2'

Report

4mo ago

@lippert Your translator marks a mistake, you ask "What's wrong with this translation?" They anwer: "It doesn't sound natural." or "It feels awkward." These are subjective answers. MQM says what exactly is wrong and in which category (Accuracy/Mistranslation; Fluency/Spelling etc.) + gives severities to these issues and overall score. Now you have an objective answer that you can measure and compare.

Report

4mo ago

Emma Intelligence

Is it for machine translation quality evaluation?

Report

4mo ago

Alconost Localization

Hunter

@valeriiavramenko Thanks for your question, Valeriia! You can use it for annotating both machine translation and human translation.

Report

4mo ago

Hi! Congratulations to the team on launching on PH! 🚀🚀

How is the final score calculated? 🧐

Report

4mo ago

Alconost Localization

Hunter

@julia_zakharova2 thank you! ^__^

Regarding the final score: each error gets a penalty based on its severity (minor, major, critical) multiplied by configurable weights. The tool calculates penalties per segment and for the whole document, then compares against your pass/fail threshold. You can adjust all the weights to match your quality requirements.

It may sound a bit complicated, but hopefully I've managed to explain it :)

Report

4mo ago

Glam AI

We're evaluating MT engines for our app. Would this help us compare outputs?

Report

4mo ago

Maker

@dshakhbazyan yerrrr, that's one of the main use cases. run the same source content through different MT engines, annotate the outputs using the same criteria, and compare the scores. the reports show error breakdown by category so you can see which engine struggles with terminology vs. fluency, for example

Report

4mo ago

Glam AI

@nickzaleski thanks, sounds awesome!

Report

4mo ago

Alconost Localization

Good luck with the launch, team!

Report

4mo ago

Alconost Localization

Hunter

@dioiv thank you! :)

Report

4mo ago

I’ve played with it a bit and I see the uploaded samples already contain marked errors. Is this some kind of AI that does preliminary annotation job?

Report

4mo ago

Maker

@anna_kuznetsova12 No AI involved here.

We uploaded samples that were already annotated by human linguists to show you what marked errors look like. We wanted to give you a quick way to explore the interface without starting from scratch.

Report

4mo ago

Cool, congrats on the launch! Do you plan to support more file formats?

Report

4mo ago

Maker

@alex_sh78 actually we chose these formats as they are all you need to do evaluation/LQA kind of tasks.

What file formats did you have in mind?

Report

4mo ago

Loki.Build

Are there any limitations on usage or is the tool totally free?

Report

4mo ago

Alconost Localization

Hunter

@annet_sukhareva It really is free, and will stay free :) you only need to pay for the work of annotators, be it Alconost’s professional annotators or your in-house team.

Report

4mo ago

Interesting tool. You mentioned one can do LQA with it - but LQA is different from just annotating/marking errors.

Report

4mo ago

Maker

@liza_diagel that's right, the tool's primary purpose is translation evaluation. but there's also a field for correcting translations, so you can use it for full LQA workflows, not just error marking

Report

4mo ago

1 2