// the find
leandromoreira/digital_video_introduction
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
A thorough written guide to digital video fundamentals — pixels, color models, chroma subsampling, frame types, DCT, quantization, entropy coding, and streaming — aimed at software engineers with no prior video background. Comes with Jupyter notebooks for hands-on experiments and Docker-based ffmpeg examples. Think of it as the missing textbook chapter before you touch a real video pipeline.
- Unusually good visual explanations: the diagrams for YCbCr subsampling, motion vectors, DCT coefficients, and H.264 bitstream structure are genuinely clearer than most paid textbooks
- Hands-on exercises actually work — Docker containerizes ffmpeg/mediainfo so the commands run identically regardless of host OS, which removes a major friction point for learners
- Covers the full stack from raw pixel math through codec internals (H.264, HEVC, AV1, VP9) to HLS/DASH adaptive streaming and DRM in a single coherent narrative
- Available in 8 languages with community-maintained translations, which is rare for this kind of technical material
- Last meaningful update was 2023 and the AV1 section feels thin compared to the H.264 coverage — AV1 is no longer new and deserves the same depth
- The Intel Video Pro Analyzer is repeatedly recommended as a visualization tool but it's commercial software; the free trial caps at 10 frames, making several hands-on sections essentially dead ends for most readers
- No coverage of modern encoding toolchains like SVT-AV1, rav1e, or cloud encoding workflows — the ffmpeg examples are fine for learning but leave a gap before production use
- Jupyter notebooks have no dependency pinning (no requirements.txt or lock file), so notebook exercises break silently as numpy/matplotlib APIs drift