How to Convert Video to Audio: Extract Sound from Any Video

Every day, millions of people need to get the audio out of a video file. A student wants the audio from a recorded lecture so they can listen during their commute. A podcaster filmed their interview on video and needs to publish the audio separately. A musician wants to extract the soundtrack from a live performance video. A content creator needs to separate dialogue from background music for a remix.

The good news is that extracting audio from video is one of the fastest and simplest conversion tasks — when done correctly. The keyword is "correctly." The wrong approach re-encodes audio unnecessarily, adds generation loss, wastes time, and produces inferior results. The right approach identifies the audio codec already inside the video, extracts it directly when possible, and only re-encodes when the target format requires a different codec.

This tutorial covers every aspect of video-to-audio conversion: how audio is stored inside video files, when to extract without re-encoding versus when transcoding is necessary, how to choose the right output format, batch processing workflows, and metadata preservation. For an even deeper dive into audio extraction specifically, see our comprehensive extraction guide.

Video to audio conversion workflow showing format options

Understanding Audio Inside Video Files

A video file is a container that holds multiple streams: video, audio, and sometimes subtitles and metadata. The container format (MP4, MKV, MOV, WebM, AVI) determines what codecs can be stored inside, but the actual audio encoding is independent of the container.

Here is what you typically find inside common video formats:

Video Container	Typical Audio Codec	Audio Format Equivalent
MP4	AAC	M4A / AAC
MOV	AAC or PCM	M4A or WAV
MKV	AAC, AC3, DTS, FLAC, Opus	Various
WebM	Vorbis or Opus	OGG or Opus
AVI	MP3 or PCM	MP3 or WAV
FLV	AAC or MP3	AAC or MP3

This matters because if the audio inside your MP4 is already AAC-encoded and you want an AAC file, you can copy the audio stream directly — no quality loss, no processing time, bit-for-bit identical to what was in the video. Re-encoding is only necessary when your target format uses a different codec than what is stored in the video.

Check What Audio Is Inside Your Video

Before extracting, identify the audio codec:

ffprobe -v quiet -print_format json -show_streams input.mp4 | grep codec_name

Or for more detail:

ffprobe -v quiet -show_entries stream=codec_name,codec_type,bit_rate,sample_rate,channels -of compact input.mp4

This tells you the audio codec (aac, mp3, opus, flac, pcm_s16le, etc.), bitrate, sample rate, and channel count — everything you need to decide whether to extract directly or re-encode.

Use Case	Best Format	Bitrate	Why
General listening	MP3 320 kbps	320 kbps	Universal compatibility
Podcast distribution	MP3 128 kbps mono	128 kbps	Industry standard for speech
Music archival	FLAC	Lossless	Preserves every detail
Audio editing	WAV	Lossless	Universal editor support
iPhone/iPad playback	AAC (M4A)	256 kbps	Native Apple format
Streaming / web	Opus	128 kbps	Best quality-per-bit
Ringtone creation	MP3 or M4R	192 kbps	Phone compatibility
Transcription service	WAV 16-bit mono	Lossless	ASR engine standard input

Source Audio	Target	Quality Result	Recommendation
AAC 256k	MP3 320k	Slight quality loss	Extract AAC directly instead
AAC 128k	MP3 320k	No improvement	Use MP3 192k or extract AAC
PCM (lossless)	MP3 320k	Expected lossy quality	Good — encoding from lossless source
PCM (lossless)	FLAC	Perfect preservation	Best for archival
FLAC	MP3 320k	Expected lossy quality	Good — encoding from lossless source
MP3 128k	FLAC	No improvement	Waste of space — keeps MP3 quality

How to Convert Video to Audio: Extract Sound from Any Video

Understanding Audio Inside Video Files

Check What Audio Is Inside Your Video

Try these conversions

Related Articles

Audacity Export Settings: MP3, FLAC, WAV, and the Hidden LAME Trap

How to Extract Audio from Video: MP4 to MP3 and Beyond

How to Transcribe Video Content: Extract Audio and Convert for Text

Choosing the Right Output Format

Extracting Audio Without Re-Encoding

Extract AAC from MP4

Extract MP3 from AVI

Extract Opus from WebM

Extract FLAC from MKV

When Stream Copy Fails

Re-Encoding Audio to a Different Format

Convert to MP3

Convert to AAC

Convert to FLAC (Lossless)

Convert to WAV

Convert to Opus

Extracting Audio from Specific Time Ranges

Handling Multi-Track Audio

List All Audio Tracks

Extract a Specific Track

Downmix Surround Sound to Stereo

Batch Audio Extraction

Bash Script for Batch Extraction

Batch Extract Without Re-Encoding

Using ConvertIntoMP4 for Batch Processing

Preserving Metadata

Extracting Album Art

Adding Album Art to Extracted Audio

Quality Considerations

Re-Encoding Between Lossy Formats

Upsampling Provides No Benefit

Sample Rate Considerations

Platform-Specific Extraction

Extract Audio for Podcast Distribution

Extract Audio for Music Production

Extract Audio for Transcription

Using ConvertIntoMP4 for Audio Extraction

About the Author