Why Your YouTube Audio Sounds Wrong
You spent hours recording, editing, and mixing your video. The audio sounds perfect on your computer. You upload it to YouTube, and suddenly the vocals sound thin, the music feels compressed, the bass is muddy, and the overall volume is noticeably different from other videos on the platform. What happened?
YouTube re-encodes every video you upload — including the audio track. Regardless of what format you upload, YouTube transcodes your audio to AAC (in MP4 containers) for standard playback and Opus (in WebM containers) for higher-quality tiers. This re-encoding is unavoidable, but how much quality you lose depends heavily on the format, bitrate, and loudness of the audio you upload in the first place.
This guide covers everything you need to know about preparing audio for YouTube: the recommended formats and settings, loudness normalization to YouTube's -14 LUFS target, common mistakes that degrade quality, and the best workflow for music channels, podcasts, tutorials, and other content types.

YouTube's Audio Processing Pipeline
Understanding what YouTube does to your audio after upload helps you make better decisions about what to upload.
What Happens After You Upload
- Ingest: YouTube accepts your uploaded file (MP4, MOV, MKV, AVI, etc.)
- Demux: The audio and video tracks are separated
- Loudness analysis: YouTube measures your audio's integrated loudness in LUFS
- Loudness normalization: If your audio exceeds -14 LUFS, YouTube turns it down (but never up)
- Transcode: YouTube encodes the audio into multiple quality tiers
- Delivery: Viewers receive the appropriate tier based on their playback quality setting
YouTube's Audio Output Tiers
| Quality Tier | Codec | Container | Bitrate | Channels |
|---|---|---|---|---|
| Low (144p-240p) | AAC-LC | MP4 | ~48 kbps | Mono or Stereo |
| Medium (360p-480p) | AAC-LC | MP4 | ~128 kbps | Stereo |
| High (720p-1080p) | AAC-LC | MP4 | ~192 kbps | Stereo |
| Premium (Opus) | Opus | WebM | ~160 kbps | Stereo |
| Surround (5.1) | AAC-LC or Opus | MP4/WebM | ~384 kbps | 5.1 Surround |
The key insight here is that YouTube's highest standard audio output is approximately 192 kbps AAC. Uploading audio at a bitrate higher than this does not result in higher-quality playback for your viewers — it only gives YouTube a better source to transcode from, which can slightly improve the quality of the output.
Recommended Upload Settings
YouTube's Official Recommendations
YouTube recommends uploading audio in AAC-LC format at 384 kbps or higher inside an MP4 container. Here is the full recommended spec:
| Parameter | Recommended Setting | Notes |
|---|---|---|
| Audio Codec | AAC-LC | Preferred; MP3 and PCM also accepted |
| Sample Rate | 48 kHz | Match your video's native rate |
| Bitrate | 384 kbps+ (stereo) / 512 kbps+ (5.1) | Higher gives YouTube better source material |
| Channels | Stereo or 5.1 Surround | Mono is accepted but not recommended |
| Loudness | -14 LUFS integrated | YouTube normalizes louder audio down |
| True Peak | -1 dBTP | Prevents clipping after transcoding |
Practical Encoding Command
Here is how to prepare your audio for YouTube using FFmpeg:
# Encode audio track as AAC 384 kbps at 48 kHz
ffmpeg -i input.mov -c:v copy -c:a aac -b:a 384k -ar 48000 output.mp4
# If your source audio is 44.1 kHz, resample to 48 kHz
ffmpeg -i input.mp4 -c:v copy -c:a aac -b:a 384k -ar 48000 -af aresample=resampler=soxr output_48k.mp4
If your video already has the right video codec (H.264 or H.265) and you only need to adjust the audio, -c:v copy passes the video through without re-encoding, which is much faster and preserves video quality perfectly.
Pro Tip: Always use 48 kHz sample rate for YouTube, not 44.1 kHz. Video production standards use 48 kHz, and YouTube's processing pipeline is optimized for it. If your audio is at 44.1 kHz (common for music), resample to 48 kHz before uploading to avoid unnecessary sample rate conversion artifacts during YouTube's processing. Our sample rate guide explains why this matters.
Loudness Normalization for YouTube
Why Loudness Matters
YouTube normalizes audio loudness to ensure a consistent listening experience across videos. If your video's integrated loudness exceeds -14 LUFS, YouTube will reduce the volume. If it is below -14 LUFS, YouTube leaves it alone (it does not boost quiet audio).
This means:
- If you upload audio mastered at -8 LUFS (loud, modern pop mastering), YouTube turns it down by 6 dB.
- If you upload audio at -14 LUFS, YouTube does not adjust it.
- If you upload audio at -20 LUFS, YouTube leaves it quiet. Your video will sound noticeably softer than other content.
The practical recommendation is to target -14 LUFS with a true peak of -1 dBTP. This matches YouTube's target, avoids any loudness reduction, and provides sufficient headroom to prevent clipping during transcoding.
Normalizing with FFmpeg
# Two-pass loudnorm (most accurate)
# Pass 1: Analyze
ffmpeg -i input.mp4 -af loudnorm=I=-14:TP=-1:LRA=11:print_format=json -f null /dev/null
# Pass 2: Apply (use measured values from pass 1)
ffmpeg -i input.mp4 -c:v copy -af loudnorm=I=-14:TP=-1:LRA=11:measured_I=-18.5:measured_LRA=8.2:measured_TP=-2.1:measured_thresh=-28.9 -c:a aac -b:a 384k output.mp4
The two-pass approach gives more accurate results than single-pass because the encoder knows the full file's loudness characteristics before processing. For a complete walkthrough of audio normalization techniques, see our dedicated guide on how to normalize audio volume.
Pro Tip: You can check YouTube's loudness normalization status on any video by right-clicking the player and selecting "Stats for nerds." The "Volume / Normalized" line shows the content loudness and the normalization factor YouTube applied. Use this to verify your normalization is working correctly.
Best Settings by Content Type
Music Videos and Music Channels
Music demands the highest audio quality. Your listeners may be using headphones or quality speakers, and they are actively paying attention to the audio.
- Codec: AAC-LC at 384+ kbps (or PCM/WAV for maximum source quality)
- Sample Rate: 48 kHz
- Loudness: -14 LUFS (resist the temptation to master louder — YouTube will just turn it down)
- True Peak: -1 dBTP
- Stereo Width: Preserve full stereo field; avoid excessive widening that sounds unnatural on earbuds
If you are uploading a music track over a static image or simple visualizer, consider uploading at 1080p or higher even if the visual is simple. YouTube assigns higher audio bitrates to higher-resolution videos, so a 4K upload gets better audio treatment than a 480p upload with identical audio.
For preparing your audio master for YouTube, our audio converter can help with format conversion, and our guide on audio bitrate and quality covers the encoding decisions in depth.
Podcasts and Talking Head Videos
Voice content is more forgiving of compression than music. The frequency range is narrower, there are fewer simultaneous sounds, and listeners are typically focused on comprehension rather than sonic fidelity.
- Codec: AAC-LC at 192+ kbps
- Sample Rate: 48 kHz
- Loudness: -14 LUFS (critical for podcast consistency)
- Mono vs. Stereo: Stereo is recommended even for single-speaker content. YouTube handles stereo more gracefully than mono in some playback scenarios.
- Noise reduction: Apply before export. Background noise becomes more audible after YouTube's compression.
See our best audio format for podcasts guide for format recommendations beyond YouTube.
Gaming and Screen Recording
Gaming content typically combines game audio, voice commentary, and sometimes music. The mixed audio can be challenging for codecs.
- Codec: AAC-LC at 256+ kbps
- Sample Rate: 48 kHz
- Loudness: -14 LUFS (use a compressor/limiter on voice to keep it above game audio)
- Tip: Mix voice and game audio separately before export. OBS Studio and most recording tools support separate audio tracks. Mix the voice 3-6 dB above the game audio for intelligibility.
Tutorials and Educational Content
Clear voice is the priority. Background music should be subtle and not compete with narration.
- Codec: AAC-LC at 192+ kbps
- Sample Rate: 48 kHz
- Background music: Mix at -18 to -24 dB relative to voice
- Loudness: -14 LUFS

Common Audio Mistakes on YouTube
Mistake 1: Uploading MP3 Audio in a Video Container
If you export your video with an MP3 audio track (common with some older editing software), YouTube re-encodes that already-lossy MP3 into AAC. This is lossy-to-lossy transcoding, which compounds artifacts. Always export with AAC or uncompressed audio.
Mistake 2: Mastering Too Loud
If your audio is mastered at -6 LUFS (common for commercial music releases), YouTube reduces it by 8 dB. This makes your dynamics sound artificially compressed because the original mastering crushed the dynamic range to achieve that loudness, and now YouTube has turned it all down, exposing the compression artifacts without the perceived loudness benefit.
Mistake 3: Wrong Sample Rate
Uploading 44.1 kHz audio forces YouTube to resample to 48 kHz. While modern resamplers are excellent, this is an unnecessary processing step that can introduce subtle artifacts, especially on high-frequency content. Record and export at 48 kHz from the start.
Mistake 4: Forgetting About Mobile Listeners
Over 70% of YouTube viewing happens on mobile devices, many using the phone's built-in speaker or cheap earbuds. Audio that sounds great on studio monitors may have problems on these devices — particularly with low-end frequencies that small speakers cannot reproduce. Always check your mix on phone speakers and earbuds before uploading.
Mistake 5: Not Leaving Headroom for Transcoding
YouTube's AAC transcoding can introduce inter-sample peaks that exceed 0 dBFS even if your original file does not clip. Leaving -1 dBTP of headroom prevents this. If your original peaks at -0.1 dBFS, the transcoded version may clip. Use a limiter with a ceiling of -1 dBTP.
Preparing Audio-Only Content for YouTube
If you want to upload a podcast episode, audiobook chapter, or music track to YouTube without video, you need to pair it with a video track. YouTube does not accept audio-only files.
Simple Static Image Method
# Create video from static image + audio file
ffmpeg -loop 1 -i cover.jpg -i audio.flac -c:v libx264 -tune stillimage -c:a aac -b:a 384k -ar 48000 -shortest -pix_fmt yuv420p output.mp4
Audio Visualizer Method
For music channels, an audio visualizer adds visual interest. Tools like After Effects, HitFilm, and open-source alternatives can generate waveform or spectrum visualizations synced to your audio.
If you need to convert your audio file to a YouTube-compatible format first, our MP3 converter, AAC converter, or WAV converter can handle the preparation. For extracting audio from existing videos to re-edit and re-upload, see our guide on how to extract audio from video.
YouTube Shorts Audio
YouTube Shorts have slightly different audio considerations:
- Maximum length is 3 minutes (extended from the original 60 seconds)
- Audio is looped if the video is short enough to auto-replay
- Most viewers watch with sound on (unlike some other short-form platforms)
- The audio track should sound good even when cut at the loop point
For Shorts, the same audio settings apply (AAC, 48 kHz, -14 LUFS), but pay extra attention to the beginning and end of your audio to ensure smooth looping if your Short is under 60 seconds.
5.1 Surround Sound on YouTube
YouTube supports 5.1 surround sound audio. To upload surround content:
# Encode 5.1 surround for YouTube
ffmpeg -i input_51.wav -c:a aac -b:a 512k -ar 48000 -ac 6 -channel_layout 5.1 output.mp4
The surround encoding must use the correct channel layout: Front Left, Front Right, Center, LFE (subwoofer), Surround Left, Surround Right. Listeners with compatible setups will hear the surround mix; others receive a stereo downmix.

Troubleshooting Audio Issues
Audio Sounds Compressed or Thin After Upload
This usually means your source audio bitrate was too low or you uploaded in a lossy format that YouTube re-encoded. Solution: Export audio as AAC at 384+ kbps or uncompressed WAV/FLAC before muxing into your video file.
Audio Volume Is Lower Than Other YouTube Videos
Your integrated loudness is below -14 LUFS. Normalize your audio to -14 LUFS before uploading. YouTube does not boost quiet audio, so videos below the target sound soft compared to properly normalized content.
Audio and Video Are Out of Sync
This is usually caused by variable frame rate (VFR) recording, common with screen recording tools and phone cameras. Convert to constant frame rate (CFR) before uploading:
ffmpeg -i input_vfr.mp4 -c:v libx264 -c:a aac -b:a 384k -vsync cfr output_cfr.mp4
Audio Sounds Different at Different Quality Settings
YouTube encodes different quality tiers at different bitrates. At 360p, audio is only ~128 kbps AAC. At 1080p, it is ~192 kbps. If your content relies on high audio quality, encourage viewers to watch at higher quality settings, or mention it in your video description.
Checklist Before Uploading
Use this checklist for every YouTube upload to ensure optimal audio quality:
- Audio codec is AAC-LC (not MP3) at 384 kbps or higher
- Sample rate is 48 kHz
- Integrated loudness is at or near -14 LUFS
- True peak does not exceed -1 dBTP
- No clipping or distortion in the audio
- Video resolution is 1080p or higher (for better audio bitrate allocation)
- Audio and video are in sync throughout the entire duration
- Tested playback on phone speakers and earbuds
- Background noise is minimal (apply noise reduction if needed)
- Stereo audio (even for mono sources, exported as dual-mono)
For the best starting point, convert your audio to the right format using our audio converter, check your format choices against our AAC vs MP3 comparison, and read our video upload guide for YouTube for the complete video+audio optimization workflow.



