This page describes an open source CLI tool offered by Corti, providing a means for consistent, best practice speech to text evaluations. Measure accuracy beyond simple metrics like Word Error Rate (WER)
Identify failures on critical terms (e.g., medical vocabulary)
Compare models, configurations, and prompts reliably
Debug systematic errors and improve product quality
Identify failures on critical terms (e.g., medical vocabulary)
Compare models, configurations, and prompts reliably
Debug systematic errors and improve product quality
A Better Way to Evaluate ASR
Metrics like WER treat all words equally, but in real-world use cases, not all errors carry the same weight. Missing filler words is very different from missing a diagnosis, medication, or key symptom. ErrorAlign is a next-generation alignment and evaluation method designed for modern speech recognition. Instead of forcing transcripts into rigid, one-to-one comparisons, it produces more natural, human-like mappings between reference and model output. This makes error analysis clearer, more reliable, and more actionable.Click here to read the Error Align paper
Produces more human-like transcript comparisons
Improves error attribution
Enables deeper, more actionable analysis
Improves error attribution
Enables deeper, more actionable analysis
Measure and Visualize Performance
Corti Canal is an open source command-line tool that measures and visualizes speech-recognition performance. Input a CSV with the expected reference transcripts alongside the model-generated output. A self-contained HTML report will be produced containing overall accuracy metrics and a word-by-word comparison, so you can see exactly where the model gets things right and where it does not.- One to many files can be evaluated in a single run with metrics (e.g., word error rate, character error rate, and medical term recall) calculated across all rows in the CSV.
- Choose between standard Levenshtein distance or Corti’s Error Align algorithm for evaluation methodology.
- Medical terms are defined based on your preferences: reference a vocabulary library to be used in the evaluation, augment and maintain the vocabulary over time or customize for a given analysis.
- Analysis normalizes output (all lowercase, without punctuation) by default, but an option to disable normalization is available to include checking of casing and punctuation.
- In addition to the metrics, a visualization of the results is included, overlaying generated and final results to see each error and medical term identified in the results.

Click here to view Corti Canal on Github
Built-in normalization
Advanced alignment methods
Visual error analysis
Domain-specific metrics (e.g., medical term recall)
Advanced alignment methods
Visual error analysis
Domain-specific metrics (e.g., medical term recall)
What This Enables
Instead of merely asking:“What is the WER?”You can answer:
“Where is my model failing, and does it matter?”
See some of our supporting research here: