Coding of Speech, Audio and Video

Prof. Dr. S. Feldes

Contents

70% lectures, 20% exercises, 10% laboratory team work

Introduction – Basic concepts, Redundancy & Irrelevancy, Quality Assessment, Stochastic Fundamentals

Quantization – Uniform Quantization, Companding (A-law, μ-law), Optimal Quantizer (Lloyd-Max), Forward-/ Backward adaptive Quantization, Vector Quantization, Codebook design by Linde-Buzo-Gray

Prediction – Principle of Predictive Coding, Prediction Gain, Adaptive Prediction, Block- & Sequential Adaption, Open-Loop / Closed-loop, Noise Shaping, ADPCM

Lossless Coding –Entropy, Code rate, Decodability, Discrete Sources (memoryless and with memory), Shannon Source Coding Theorem, Huffman-Coding, Arithmetic Coding, Lempel-Ziv-Welch, Run Length Coding

Speech Coding – Human Articulation Process, Source-Filter-Model; Vocoder, Linear Predictive Coding, Long Term Prediction, Excitation modelling, Analysis-by-Synthesis, CELP; Variable-Bitrate & Embedded Coding; Standards: UMTS, ITU G.7xx

Audio Coding – Human ear & hearing; Psycho-acoustic Model, Spectral & temporal masking; Subband Coding, Transform Coding, MDCT, QMF-filter banks; Spectral Band Replication; Standards: MP3, AAC, HE-AAC

Image & Video Coding – Human Eye & Visual Perception; 2D-DCT, Wavelet-Transform, Hybrid Coding, Motion Compensated Prediction, Motion Estimation, Blockmatching; Standards: JPEG(2000), MPEG-1, -2, -4, AVC

Full version (in German)