Skip to content

Transcribing Large Audio Files With The STT Large Speech Model (LSM) #83

@lmazzoli

Description

@lmazzoli

When transcribing large speech models using Watson STT new Large Speech Model (LSM), the results are separated based on the End of Phrase Silence time. Each part of the response is returned to the On Data portion of the MyRecognizeCallback class separately. This causes only the last part of the audio transcript to be recorded as the STT transcript.

Please update the code to produce a single, full audio transcript in the output transcriptions file when using the LSM.

@gecock

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions