Real-time ambient documentation

Introduction

The Real-Time Ambient Documentation Workflow is designed for scenarios where immediate processing and feedback are essential during live interactions between medical practitioners and patients. This workflow leverages WebSocket connections to enable continuous audio streaming, real-time transcription, and dynamic updates throughout the interaction. It is ideal for situations where clinicians need to capture and act on information instantaneously, ensuring that all relevant data is processed and available in real-time.

Endpoints and capabilities

Endpoint	Capability	Use
Interactions	The foundational unit that ties together all related data and operations, enabling a cohesive workflow from the start of the interaction to the generation of final documentation.	Required
Streams	Real-time audio streaming, conversational transcript, and fact extraction	Required
Facts	Used by /streams endpoint for fact extraction; can also be called to get, update or add facts	Optional
Documents	Ambient note generation based on template and language defined in the request	Required
Templates	Get details for available templates that define the structure and language of generated documents	Optional

Workflow

Initialization

The workflow begins with the client application initiating an interaction. This is done by sending a POST request to the /interactions endpoint.
The API responds with a unique interactionId and a WebSocket URL (wssUrl). The identifier will be used to manage the subsequent steps of the workflow.
Optional Pre-Interaction Data: If the client has pre-existing information (known Facts) about the patient or session, these can be sent to /interactions/:id/facts before starting the live interaction.
Start the interaction: The client should PATCH to /interactions/:id changing the interaction state to “in-progress”

Live Interaction

Opening the WebSocket Connection: The client establishes a real-time connection by opening a WebSocket using the provided wssUrl. This connection serves as the primary channel for transmitting and receiving data throughout the interaction.
Streaming and Configuration: Upon connection, the client should send configuration details to tailor the interaction. These configurations are committed via the WebSocket, which acknowledges them back to the client.
Continuous Data Flow: As the interaction progresses, the client sends audio packets through the WebSocket. In return, the API streams back transcripts in 3 to 20s intervals, and automatically generated Facts in 60s intervals.
During the interaction, the Client may need to introduce new facts based on the ongoing conversation. These are sent to /interactions/:id/facts, adding to the current interaction data.

See details for /streams requests and configuration options here

Document generation

Select facts and/or transcript that should be utilized for note generation
Generate documents by sending a POST request to /interactions/:id/documents. This request should include selected facts and/or transcript generated in the real-time stream. A templateKey and language are defined to specify the format of the output document.
The API processes the request and returns a 200 status along with the final document, which is ready for use or further editing.
Submit multiple POST requests to the /documents endpoint with different templateKey to generate multiple notes for a given interaction.

See details on Templates and how to use them here

Ending the Interaction

Once the live interaction is complete, the client sends an "end" message via the WebSocket, signalling the conclusion of the session.
The WebSocket then sends any final updates and the "ended" message formally closing the live interaction.
After the live session, the client can review and make final edits to the facts collected during the interaction.
These finalized Facts are then submitted through a POST request to /interactions/:id/documents, which generates the desired documentation, such as clinical notes or summaries, based on the provided data and a template.
The API processes this request and returns the completed document, ready for use or further review.
The client should PATCH to /interactions/:id changing the interaction state to “completed”

Get Started

About Corti API

Speech Recognition

Templates and Documents

Workflows

SDKs

Introduction

Endpoints and capabilities

Workflow

Initialization

Live Interaction

Document generation

Ending the Interaction

Get Started

About Corti API

Speech Recognition

Templates and Documents

Workflows

SDKs

​Introduction

​Endpoints and capabilities

​Workflow

Initialization

Live Interaction

Document generation

Ending the Interaction

Introduction

Endpoints and capabilities

Workflow