Skip to main content
Corti speech recognition and text generation are specifically designed for use in the healthcare domain. Automated speech recognition (ASR) language models are designed to balance recognition speed, performance, and accuracy. Text generation LLMs accept various inputs depending on the workflow (e.g., transcripts or facts) and have defined guardrails to support quality assurance of facts and documents outputs.
The language codes listed below are used in API requests to define output language for speech-to-text and document generation.
  • Learn more about speech recognition workflows here.
  • Learn how to query the API for document templates available by language here.

Speech Recognition Performance Tiers

Corti speech recognition uses a tier system to categorize functionality and performance that is available per language:
TierDescription
BaseAI-powered speech-to-text capability, ready to integrate with healthcare IT solutions via the /stream or /transcripts APIUp to 1,000
EnhancedBase plus optimized medical vocabulary for a variety of specialties and support for real-time dictation via the /transcribe API1,000-99,999
PremierEnhanced plus speech recognition models delivering the best performance in terms of accuracy, quality, and latency100,000+

Language Availability per Endpoint

The table below summarizes languages supported by the Corti API and how they can be used with speech recognition endpoints (Transcribe, Stream, and Transcripts) and text generation endpoints (Documents):
LanguageLanguage Code
ArabicarBase
DanishdaPremier2
DutchnlEnhanced
English (US)en or en-USPremier2
English (UK)en-GBPremier2
FrenchfrPremier2
GermandePremier2
HungarianhuEnhanced
ItalianitBase
NorwegiannoEnhanced
PortugueseptBase
SpanishesBase
SwedishsvEnhanced
Swiss Germangsw-CH3Enhanced2
Swiss High Germande-CH3Premier2

Notes:
1 Use the language codes listed above for the outputLanguage parameter in POST/documents requests. Template(s) or section(s) in the defined output must be available for successful document generation.
For workflows leveraging /stream for real-time transcript generation and fact extraction, please be aware that the outputLanguage for facts must also be supported by speech recognition.While there is general support for translation between English transcript to facts in other languages (e.g. German, French, Danish, etc.), additional translation language-pair combinations are not quality assessed or performance benchmarked at this time.
2 Speech recognition accuracy for async audio file processing via /transcripts endpoint may be degraded as compared to real-time recognition via the /transcribe and /stream endpoints. An update is expected November 2025 to address the performance issue.3 Use language code gsw-CH when dialectical Swiss German is spoken but Swiss High German is the expected written form (e.g., conversational AI scribing). Use language code de-CH when Swiss High German is both spoken and the expected written form (e.g., dictation).

Please contact us if you are interested in a language that is not listed here, need help with tiers and endpoint definitions, or have questions about how to use language codes in API requests.