Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
47 changes: 47 additions & 0 deletions chapters/live-stt/features/endpointing.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,47 @@
---
title: "Endpointing"

Check warning on line 2 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L2

Did you really mean 'Endpointing'?
description: "What's endpointing and how it works"

Check warning on line 3 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L3

Did you really mean 'endpointing'?
---

Endpointing is the mechanism Gladia uses in live transcription to decide when a speaker has "finished" an utterance, so the API can close that utterance and emit a final transcript segment.

Check warning on line 6 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L6

Did you really mean 'Endpointing'?

Check warning on line 6 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L6

Did you really mean 'Gladia'?

In practice, endpointing answers the question: "How much silence should we wait before we consider the sentence (or turn) complete?"

Check warning on line 8 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L8

Did you really mean 'endpointing'?

### Why endpointing matters

Check warning on line 10 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L10

Did you really mean 'endpointing'?

Endpointing is one of the main knobs that controls the tradeoff between:

Check warning on line 12 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L12

Did you really mean 'Endpointing'?
- **Latency (speed)**: how quickly you get final utterances
- **Completeness**: whether you avoid cutting someone off mid-thought
- **Chunking quality**: whether utterances align well with natural turns or sentences

Lower endpointing values feel "snappier" (great for voice agents), while higher values tend to produce cleaner, more complete segments (great for meetings and lectures).

Check warning on line 17 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L17

Did you really mean 'endpointing'?

### How it works conceptually

During a live session, Gladia continuously analyzes the incoming audio stream and:

Check warning on line 21 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L21

Did you really mean 'Gladia'?
1. Detects speech activity on each channel (voice activity detection)
2. Groups speech into an "utterance" while speech is ongoing
3. When it observes silence lasting at least endpointing seconds, it considers the utterance finished and closes it (finalizes it).

Check warning on line 24 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L24

Did you really mean 'endpointing'?
4. The AI model is then used to transcribe the final result of the utterance.
5. If speech never pauses long enough, Gladia still has a safety mechanism to close the utterance (*maximum_duration_without_endpointing*, see next section)

Check warning on line 26 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L26

Did you really mean 'Gladia'?

Check warning on line 26 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L26

Did you really mean 'maximum_duration_without_endpointing'?

You can also subscribe to speech activity messages to know when speech [starts](https://docs.gladia.io/api-reference/v2/live/callback/speech-start) and [ends](https://docs.gladia.io/api-reference/v2/live/callback/speech-end) (useful to drive UI or agent turn-taking)

### The 2 key parameters

**endpointing (seconds)** \

Check warning on line 32 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L32

Did you really mean 'endpointing'?
Definition: the duration of silence that closes the current utterance.
- Default: 0.05
- Range: 0.01 to 10

Effect:
- Smaller value = closes utterances faster, but can split sentences if the speaker hesitates briefly.
- Larger value = waits longer before finalizing, which improves segment completeness but increases latency.

**maximum_duration_without_endpointing (seconds)**

Check warning on line 41 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L41

Did you really mean 'maximum_duration_without_endpointing'?

Definition: maximum amount of time Gladia will keep an utterance open without detecting endpointing silence. If that limit is reached, the utterance is considered finished anyway.

Check warning on line 43 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L43

Did you really mean 'Gladia'?

Check warning on line 43 in chapters/live-stt/features/endpointing.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/endpointing.mdx#L43

Did you really mean 'endpointing'?
- Default: 5
- Range: 5 to 60

Why it exists: it prevents extremely long, never-ending utterances (for example: constant background noise, a speaker who never pauses, or long monologues), which is important for downstream UX and processing stability.
8 changes: 8 additions & 0 deletions chapters/live-stt/features/index.mdx
Original file line number Diff line number Diff line change
@@ -1,12 +1,20 @@
---
title: "Live transcription Features"
description: "Core features of Gladia's real-time speech-to-text (STT) API"

Check warning on line 3 in chapters/live-stt/features/index.mdx

View check run for this annotation

Mintlify / Mintlify Validation (gladia-95) - vale-spellcheck

chapters/live-stt/features/index.mdx#L3

Did you really mean 'Gladia's'?
---

<Note>All the configuration properties described in the feature pages are defined in the [POST /v2/live endpoint](/api-reference/v2/live/init).</Note>

<CardGroup cols={3}>

<Card
title="Endpointing"
icon="clock"
href="/chapters/live-stt/features/endpointing"
>
Control how long to wait for silence before closing an utterance.
</Card>

<Card
title="Partial transcripts"
icon="list-check"
Expand Down
1 change: 1 addition & 0 deletions docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -41,6 +41,7 @@
"pages": [
"chapters/live-stt/quickstart",
"chapters/live-stt/audio-intelligence",
"chapters/live-stt/features/endpointing",
"chapters/live-stt/features/partial-transcripts",
"chapters/live-stt/features/multiple-channels",
"chapters/live-stt/features/custom-vocabulary",
Expand Down