Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions chapters/audio-intelligence/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,14 @@ Use these capabilities alongside Live or Pre-recorded STT to automate workflows
Detect and categorize key entities like people, organizations, dates, and more.
</Card>

<Card
title="PII Redaction"
icon="shield-halved"
href="/chapters/audio-intelligence/pii-redaction"
>
Automatically redact names, emails, vehicle IDs, and other PII in pre-recorded transcripts.
</Card>

<Card
title="Chapterization"
icon="bookmark"
Expand Down
63 changes: 63 additions & 0 deletions chapters/audio-intelligence/pii-redaction.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,63 @@
---
title: PII Redaction
description: "Automatically detect and redact personally identifiable information in pre-recorded transcripts"
---

**PII Redaction** automatically detects and replaces sensitive entities (names, emails, addresses, etc.) in your transcript output. This feature is only available for **pre-recorded** transcription.

## Usage

Add `"pii_redaction": true` to your request to redact all detected PII in the transcript. Sensitive entities will be replaced with markers in the output.

<CodeGroup>
```json Pre-recorded
{
"audio_url": "YOUR_AUDIO_URL",
"pii_redaction": true
}
```
</CodeGroup>

## Optional configuration

You can customize the behavior with `pii_redaction_config`:

<ParamField body="entity_types" type="string[]">
Preset or list of PII entity types to redact (e.g. `["GDPR"]`).
See [Named Entity Recognition](/chapters/audio-intelligence/named-entity-recognition#supported-entities) for supported entity types.
</ParamField>
<ParamField body="processed_text_type" type="enum" default="MARKER">
How to replace detected PII:
- **`MARKER`**: Placeholder labels like `[NAME_1]`, `[EMAIL_1]`. Same entity will have same ID.
- **`MASK`**: Each character replaced by a mask (e.g. "John Smith" → `#### #####`)
</ParamField>

## Example body

```json
{
"audio_url": "YOUR_AUDIO_URL",
"pii_redaction": true,
"pii_redaction_config": {
"entity_types": ["GDPR"],
"processed_text_type": "MARKER"
}
}
```

## Example output

**Without PII redaction (raw transcript):**

> Hi, I'm calling about the order for John Smith. Can you confirm the delivery to john.smith@company.com? Yes, John Smith placed it yesterday.

**With PII redaction (`processed_text_type="MASK"`):**

> Hi, I'm calling about the order for #### #####. Can you confirm the delivery to ######################? Yes, #### ##### placed it yesterday.

**With PII redaction (`processed_text_type="MARKER"`):**

> Hi, I'm calling about the order for [NAME_1]. Can you confirm the delivery to [EMAIL_1]? Yes, [NAME_1] placed it yesterday.

The same entity mentioned multiple times receives the **same marker ID** (e.g. "John Smith" becomes [NAME_1] both times), so you can track references across the transcript while keeping sensitive data redacted. \
This consistency is also useful for downstream tasks using LLMs, which can reason about entities (e.g. "the person in [NAME_1]") without ever seeing the raw PII.
7 changes: 7 additions & 0 deletions chapters/pre-recorded-stt/features/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,13 @@ The core functionality of the Gladia API is its Speech Recognition model, design
>
Detect speakers and understand who said what, and when.
</Card>
<Card
title="PII Redaction"
icon="shield-halved"
href="/chapters/audio-intelligence/pii-redaction"
>
Automatically redact names, emails, vehicle IDs, and other PII in pre-recorded transcripts.
</Card>
<Card
title="Export subtitles (SRT/VTT)"
icon="code"
Expand Down
1 change: 1 addition & 0 deletions docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -95,6 +95,7 @@
"chapters/audio-intelligence/translation",
"chapters/audio-intelligence/summarization",
"chapters/audio-intelligence/named-entity-recognition",
"chapters/audio-intelligence/pii-redaction",
"chapters/audio-intelligence/sentiment-analysis",
"chapters/audio-intelligence/chapterization",
"chapters/audio-intelligence/audio-to-llm"
Expand Down