一句话摘要
Azure 发布说话人分离 ASR 模型 gpt-4o-transcribe-diarize,支持 100+ 语言实时转写并标注说话人。
详细描述
gpt-4o-transcribe-diarize speech-to-text model released, converting spoken language to text in real time with speaker diarization (who spoke when). Supports 100+ languages with ultra-low latency.
原文摘录
The gpt-4o-transcribe-diarize speech to text model is released. Diarization is the process of identifying who spoke when in an audio stream. It transforms conversations into speaker-attributed transcripts, enabling businesses to extract actionable insights from meetings, customer calls, and live events.