Skip to content
Talk to an Engineer Dashboard

Connect to Diarize to transcribe and diarize audio and video content from YouTube, X, Instagram, and TikTok. Submit transcription jobs and retrieve results in JSON, TXT, SRT, or VTT format.

Diarize logo

Supports authentication: Bearer Token

Submit a new transcription and diarization job for an audio or video URL (YouTube, X, Instagram, TikTok). Returns a job ID that can be used to check status and download results.

NameTypeRequiredDescription
languagestringNoLanguage code for transcription (e.g. ‘en’, ‘es’, ‘fr’). Defaults to auto-detection if not provided.
num_speakersintegerNoExpected number of speakers in the audio. Helps improve diarization accuracy.
schema_versionstringNoOptional schema version to use for tool execution
tool_versionstringNoOptional tool version to use for execution
urlstringYesThe URL of the audio or video content to transcribe (e.g. YouTube, X, Instagram, TikTok link)

Download the transcript output for a completed transcription job in JSON, TXT, SRT, or VTT format, including speaker diarization, segments, and word-level timestamps.

NameTypeRequiredDescription
formatstringNoOutput format for the transcript. Supported formats: ‘json’, ‘txt’, ‘srt’, ‘vtt’.
job_idstringYesThe unique ID of the completed transcription job
schema_versionstringNoOptional schema version to use for tool execution
tool_versionstringNoOptional tool version to use for execution

Retrieve the current status of a transcription job by its job ID. Returns job state (pending, processing, completed, failed), metadata, and an estimatedTime field (in seconds) indicating how long processing is expected to take. Use estimatedTime to determine polling frequency and max wait duration — for example, a 49-minute episode may have an estimatedTime of ~891s (~15 mins), so the agent should wait at least that long before giving up.

NameTypeRequiredDescription
job_idstringYesThe unique ID of the transcription job to check
schema_versionstringNoOptional schema version to use for tool execution
tool_versionstringNoOptional tool version to use for execution