logo-polimi
Loading...
Risorse bibliografiche
Risorsa bibliografica obbligatoria
Risorsa bibliografica facoltativa
Scheda Riassuntiva
Anno Accademico 2019/2020
Scuola Scuola di Ingegneria Industriale e dell'Informazione
Insegnamento 088966 - MULTIMEDIA SIGNAL PROCESSING
  • 089837 - MULTIMEDIA SIGNAL PROCESSING 2ND MODULE
Docente Bestagini Paolo
Cfu 5.00 Tipo insegnamento Modulo Di Corso Strutturato

Corso di Studi Codice Piano di Studio preventivamente approvato Da (compreso) A (escluso) Insegnamento
Ing Ind - Inf (Mag.)(ord. 270) - CR (263) MUSIC AND ACOUSTIC ENGINEERING*AZZZZ088966 - MULTIMEDIA SIGNAL PROCESSING
086059 - MULTIMEDIA SIGNAL PROCESSING
Ing Ind - Inf (Mag.)(ord. 270) - MI (474) TELECOMMUNICATION ENGINEERING - INGEGNERIA DELLE TELECOMUNICAZIONI*AZZZZ088966 - MULTIMEDIA SIGNAL PROCESSING
Ing Ind - Inf (Mag.)(ord. 270) - MI (481) COMPUTER SCIENCE AND ENGINEERING - INGEGNERIA INFORMATICA*AZZZZ088966 - MULTIMEDIA SIGNAL PROCESSING

Obiettivi dell'insegnamento

The goal of the course is to provide students with advanced knowledge and skills on multimedia signal processing applications, with a specific focus on coding of multimedia objects. This course covers relevant applications of digital signal processing with specific reference to multimedia communication, e.g. audio, image and video coding. Furthermore, the course gives an insight on widely-adopted international coding standards such as MPEG Audio, JPEG and MPEG Video, among others.


Risultati di apprendimento attesi

Dublin Descriptors

Expected learning outcomes

Knowledge and understanding

Students will learn how to:

·       Apply advanced processing techniques to 1D and 2D digital data

·       Encode and decode multimedia objects with well-known coding schemes

·       Evaluate distortion introduced on multimedia objects by typical processing techniques

Applying knowledge and understanding

Given specific project cases, students will be able to:

·       Develop an end-to-end encoding and decoding architecture tailored to a specific media type

·       Implement advanced 1D and 2D signal processing techniques

Lifelong learning skills

·       Students will learn how to design and develop a realistic project related to multimedia digital data processing.

·       Students will learn how to analyze the performance of multimedia coding schemes.


Argomenti trattati

  • Source coding:
    • Discrete memoryless sources, discrete sources with memory, entropy of a source, uniquely decodable and prefix codes, Shannon’s source coding theorem, Huffman coding, arithmetic coding, run length coding.
  • Quantization:
    • Uniform scalar quantization, Lloyd-Max scalar quantization, entropy constrained scalar quantization, rate-distortion theory, vector quantization.
  • Predictive coding:
    • Linear predictive coding, DPCM, delta modulation, predictive coding gain.
  • Transform coding:
    • Linear transforms, unitary transforms, linear approximation, non-linear approximation, KLT, DCT, transform coding gain, bit allocation, sub-band coding, wavelet transform, 2D transforms.
  • A review of Waveform Coding of audio signals:
    • PCM, DPCM, Delta Modulation, ADPCM. Lossless compression techniques.
  • Speech coding:
    • Vocal tract modeling, LPC, pitch extraction, voiced/unvoiced detection, analysis by synthesis.
  • Audio coding:
    • Fundamentals of psychoacoustics, frequency masking, temporal masking, filter banks (PQMF, MDCT), bit allocation and entropy coding. Coding standards: MPEG-Audio, Advanced Audio Coding (AAC), AC3.
  • Image coding:
    • Human visual system, visual redundancy and irrelevancy, lossless and lossy image coding, transform coding and quantization. Coding standards: JPEG.
  • Video coding:
    • DPCM, motion estimation, coding of prediction residuals, coding of motion vectors, rate-distortion optimization. Coding standards: MPEG-x, H.264/AVC.
  • Other multimedia applications:
    • Multimodal analysis, image intensity pointwise operations, color handling, histogram equalization, morphological processing, feature analysis, matching, registration, forensics.

Prerequisiti

Students are required to know the basic principles of signal processing and to have basic programming skills.


Modalità di valutazione

The assessment will be based on a written exam at the end of the course. The written exam consists of two parts. The first part contains numerical exercises and theoretical questions. This will assign up to 23 points. The second part consists in a programming test. This will assign up to 10 points. 30 cum laude will be assigned when the total score is higher than 31.

Type of assessment

Description

Dublin descriptor

Written test

Solution of numerical problems

·       Source entropy computation

·       Computing distortion introduced by lossy coding techniques

·       Multimedia signals filtering

Exercises focusing on design aspects

·       Development of lossless and lossy coding schemes

·       Parameters selection for multimedia processing algorithms

Theoretical questions on all course topics with open answer

·       Information theory

·       Digital signal processing applied to multimedia objects

·       Coding standards

1, 2

 

 

 

1, 2, 5

 

 

 

1, 5


Bibliografia
Risorsa bibliografica facoltativaM. Bosi, Introduction to Digital Audio Coding and Standards, Anno edizione: 2002
Risorsa bibliografica facoltativaT. Spanias and T. Painter, Audio signal processing and coding, Anno edizione: 2006
Risorsa bibliografica facoltativaR. C. Gonzalez and R. E. Woods, Digital Image Processing, Anno edizione: 2007
Risorsa bibliografica facoltativaD. Taubman and M. Marcellin, Image Compression Fundamentals, Standards and Practice, Anno edizione: 2001
Risorsa bibliografica facoltativaM. Tekalp, Digital Video Processing, Anno edizione: 1995
Risorsa bibliografica facoltativaI. Richardson, Video Codec Design: Developing Image and Video Compression Systems, Anno edizione: 2002
Risorsa bibliografica facoltativaI. Richardson, Video Codec Design: H.264 and MPEG-4 Video Compression, Anno edizione: 2003

Software utilizzato
Nessun software richiesto

Forme didattiche
Tipo Forma Didattica Ore di attività svolte in aula
(hh:mm)
Ore di studio autonome
(hh:mm)
Lezione
30:00
45:00
Esercitazione
20:00
30:00
Laboratorio Informatico
0:00
0:00
Laboratorio Sperimentale
0:00
0:00
Laboratorio Di Progetto
0:00
0:00
Totale 50:00 75:00

Informazioni in lingua inglese a supporto dell'internazionalizzazione
Insegnamento erogato in lingua Inglese
Disponibilità di materiale didattico/slides in lingua inglese
Disponibilità di libri di testo/bibliografia in lingua inglese
Possibilità di sostenere l'esame in lingua inglese
Disponibilità di supporto didattico in lingua inglese
schedaincarico v. 1.8.3 / 1.8.3
Area Servizi ICT
09/12/2023