> ## Documentation Index > Fetch the complete documentation index at: https://autonomy.computer/docs/llms.txt > Use this file to discover all available pages before exploring further. # Transcribe multi-lingual videos > Build an agentic product that automatically transcribes multi-lingual videos and translates the transcripts to english. To build a scalable video transcription product, you need to handle large file uploads, manage socket connections for real-time progress, integrate speech-to-text and language models, wrangle rate-limits, etc. As you scale to many users, infrastructure becomes the hardest art: you must orchestrate jobs, scale horizontally, balance load across model providers, pool connections, and handle failures gracefully. Most teams spend more time building infrastructure than focusing on their product. This guide walks through building a highly scalable agentic product that automatically transcribes videos and translates transcripts to English. ```text theme={null} +----------------------+ +----------------------+ +----------------------+ | Video File |------>| Extract Audio |------>| Transcribe | | (.mp4, .mov) | | (ffmpeg) | | and Diarize | +----------------------+ +----------------------+ +-----------+----------+ | +-------------------------------------------------------------+ | v +----------------------+ +----------------------+ +----------------------+ | Match Speakers | | Translate to English | | Results | | Across Chunks |------>| (Agent) |------>| (transcript + | | (Agent) | | | | translation) | +----------------------+ +----------------------+ +----------------------+ ``` The complete source code is available in [Autonomy examples](https://github.com/build-trust/autonomy/tree/main/examples/voice/video-translator).