2026-06-07
How to Extract Audio from Any Video File
Whether it's a lecture recording, a music video, or a screen capture, extracting the audio track is a one-step operation — here's everything you need to know.
Why Extract Audio?
Common reasons to separate audio from video:
- Lecture recordings: a 2 GB lecture video becomes a 20 MB MP3 you can listen to on a commute
- Music videos: keep the audio, drop the video
- Podcasts recorded as video: many are recorded with cameras but distributed as audio
- Screen recordings: extract narration from a tutorial video
- Game clips: save the soundtrack from a captured gameplay session
Supported Input Formats
Our converter handles the most common video containers:
| Format | Common source |
|---|---|
| MP4 | iPhone, Android, YouTube downloads, screen recorders |
| WebM | Chrome/Firefox screen recordings, YouTube (alternate format) |
| M4A | Apple devices (technically audio-only, but uses the MP4 container) |
What Happens During Extraction
FFmpeg performs stream demuxing: it opens the container, identifies the audio track, and transcodes it to MP3. The video track is discarded entirely — it never needs to be decoded, which is why conversion is fast even for large files.
For videos with AAC audio (most MP4s from Apple devices), FFmpeg re-encodes AAC → MP3. For videos that already contain MP3 audio, it can copy the stream directly without re-encoding.
Quality Tips
Record in the highest quality available. Audio quality is set at recording time; you cannot recover quality lost at the source.
Use 192 kbps for music, 96 kbps for speech. There's no benefit to 320 kbps if the source was compressed video.
Check for multiple audio tracks. Some videos (especially rips of foreign-language content) have multiple audio tracks. Our converter uses the first track by default.
File Size Expectations
A 1 GB, 1-hour MP4 lecture typically produces a 50–80 MB MP3 at 192 kbps. At 96 kbps (still fine for speech), that's 25–40 MB.