logo-polimi
Loading...
Risorse bibliografiche
Risorsa bibliografica obbligatoria
Risorsa bibliografica facoltativa
Scheda Riassuntiva
Anno Accademico 2019/2020
Scuola Scuola di Ingegneria Industriale e dell'Informazione
Insegnamento 088966 - MULTIMEDIA SIGNAL PROCESSING
Docente Bestagini Paolo , Marcon Marco
Cfu 10.00 Tipo insegnamento Corso Integrato

Corso di Studi Codice Piano di Studio preventivamente approvato Da (compreso) A (escluso) Insegnamento
Ing Ind - Inf (Mag.)(ord. 270) - CR (263) MUSIC AND ACOUSTIC ENGINEERING*AZZZZ088966 - MULTIMEDIA SIGNAL PROCESSING
086059 - MULTIMEDIA SIGNAL PROCESSING
Ing Ind - Inf (Mag.)(ord. 270) - MI (474) TELECOMMUNICATION ENGINEERING - INGEGNERIA DELLE TELECOMUNICAZIONI*AZZZZ088966 - MULTIMEDIA SIGNAL PROCESSING
Ing Ind - Inf (Mag.)(ord. 270) - MI (481) COMPUTER SCIENCE AND ENGINEERING - INGEGNERIA INFORMATICA*AZZZZ099325 - FUNDAMENTALS OF MULTIMEDIA SIGNAL PROCESSING
088966 - MULTIMEDIA SIGNAL PROCESSING

Obiettivi dell'insegnamento

The course is split into two modules.

The first module introduces the fundamental tools for digital signal processing. It mainly focuses on signal analysis and filtering concerning audio and video sources but the theoretical part allows the student to easily extend his/her knowledge to other digital signal processing fields. The course aim at providing the participants with basic tools for digital signal filtering and analysis, and with the knowledge to design specific processing tools for 1D and 2D discrete signals.

The goal of the second module is to provide students with advanced knowledge and skills on multimedia signal processing applications, with a specific focus on coding of multimedia objects. This course covers relevant applications of digital signal processing with specific reference to multimedia communication, e.g. audio, image and video coding. Furthermore, the course gives an insight on widely-adopted international coding standards such as MPEG Audio, JPEG and MPEG Video, among others.

Both modules focus on theoretical and practical aspects with laboratory classrooms in MATLAB where students can directly apply and verify learnt notions.


Risultati di apprendimento attesi

Dublin Descriptors

Expected learning outcomes

Knowledge and understanding

Students will learn how to:

·       Move from the continuous to the discrete signals representation.

·       Deal with common discrete signal transforms (e.g. DTFT, DFT, Z transf., STFT) in 1D and 2D spaces.

·       Change sample rate of a discrete signal.

·       Deal with real-time digital signal processing and windowing.

·       Adopt multi-rate signal processing and design perfect reconstruction filter banks.

·       Apply advanced processing techniques to 1D and 2D digital data.

·       Encode and decode multimedia objects with well-known coding schemes.

·       Evaluate distortion introduced on multimedia objects by typical processing techniques.

Applying knowledge and understanding

Given specific project cases, students will be able to:

·       Design the proper filter for a specific application. Examples provided during the course will mainly focus on audio and video signals but the acquired theoretical background will allow the student to deal with digital filter design in other application fields.

·       Tune the system design in order to fulfill the specifications constraints in terms, e.g., of maximum delay, phase distortion or maximum filter length.

·       Develop an end-to-end encoding and decoding architecture tailored to a specific media type

·       Implement advanced 1D and 2D signal processing techniques

Lifelong learning skills

·       Students will learn how to design specific digital filters in order to fulfill project requirements and system constraints.

·       Students will learn how to analyze the behavior of digital filters both in time and in the frequency spaces.

·       Students will learn how to design and develop a realistic project related to multimedia digital data processing.

·       Students will learn how to analyze the performance of multimedia coding schemes.


Argomenti trattati

Module I: Fundamentals of digital signal processing

  • Review of Analog Signal Processing theory:
    • Some fundamentals of Analog Signal Processing will be recalled (sampling, quantization, Fourier Transform and Series, noise description...).
  • Introduction to Discrete Signals Transforms:
    • The z-transform and the Discrete Fourier Transform will be described with their properties and application to filter design and analysis.
  • Introduction to Digital Filters:
    • This course part will cover Time-domain and space domain filter representations, transfer function analysis, frequency response analysis, Finite and Infinite impulse response implementation; stability analysis
  • Windowing and Short Time Fourier Transform:
    • Overview of windows and real-time processing; overlap and add, overlap and save,Short Time Fourier Transform for real time processing
  • Introduction to Multirate Processing:
    • Downsampling, upsampling, polyphase filters and perfect reconstruction filter banks will be introduced.
  • Multimedia applications:
    • Practical filter design for audio signals, basic image processing (e.g., edge extraction, histogram processing,rescaling).

 

Module II: Fundamentals of coding

  • Source coding:
    • Discrete memoryless sources, discrete sources with memory, entropy of a source, uniquely decodable and prefix codes, Shannon’s source coding theorem, Huffman coding, arithmetic coding, run length coding.
  • Quantization:
    • Uniform scalar quantization, Lloyd-Max scalar quantization, entropy constrained scalar quantization, rate-distortion theory, vector quantization.
  • Predictive coding:
    • Linear predictive coding, DPCM, delta modulation, predictive coding gain.
  • Transform coding:
    • Linear transforms, unitary transforms, linear approximation, non-linear approximation, KLT, DCT, transform coding gain, bit allocation, sub-band coding, wavelet transform, 2D transforms.
  • A review of Waveform Coding of audio signals:
    • PCM, DPCM, Delta Modulation, ADPCM. Lossless compression techniques.
  • Speech coding:
    • Vocal tract modeling, LPC, pitch extraction, voiced/unvoiced detection, analysis by synthesis.
  • Audio coding:
    • Fundamentals of psychoacoustics, frequency masking, temporal masking, filter banks (PQMF, MDCT), bit allocation and entropy coding. Coding standards: MPEG-Audio, Advanced Audio Coding (AAC), AC3.
  • Image coding:
    • Human visual system, visual redundancy and irrelevancy, lossless and lossy image coding, transform coding and quantization. Coding standards: JPEG.
  • Video coding:
    • DPCM, motion estimation, coding of prediction residuals, coding of motion vectors, rate-distortion optimization. Coding standards: MPEG-x, H.264/AVC.
  • Other multimedia applications:
    • Multimodal analysis, image intensity pointwise operations, color handling, histogram equalization, morphological processing, feature analysis, matching, registration, forensics.

Innovative Didactics

The "Flipped Classroom" methodology will be applied to practical aspects concerning better understanding of Matlab techniques for a proper signal processing.


Prerequisiti

Students are required to know the basic principles of signal processing and to have basic programming skills.


Modalità di valutazione

Module I: Fundamentals of digital signal processing

The assessment will be based on a written exam at the end of the course. The written exam consists of numerical exercises, theoretical questions and programming exercises (in MatLab®); no further oral exams or projects are considered. Further details on the exam procedure are available on the lecturer’s homepage.

Type of assessment

Description

Dublin descriptor

Written test

Solution of numerical problems

·       Filter design from project specifications.

·       Filter analysis and output calculation from a given input in time, space or frequency domains.

·       Procedures for signal trasformation: scaling, upsampling, downsampling

·       Multirate filter design

·       Windowing effect on different signals.

Exercises focusing on design aspects

·       Write Matlab® code to design a filter according to the specifications and process an assigned input.

·       Write Matlab® code to analyze a filter behaviour and process an assigned input.

·       Implement in Simulink ® a system for real time signal processing.

·       Write Matlab® code for signal transformation, features extraction.

·       Write Matlab® code for multirate signal processing, windowing and perfect reconstruction filters.

Theoretical questions on all course topics with open answer

1, 2

 

 

 

 

 

1, 2, 5

 

 

 

 

 

 

 

 

1, 5

 

Module II: Fundamentals of coding

The assessment will be based on a written exam at the end of the course. The written exam consists of two parts. The first part contains numerical exercises and theoretical questions. This will assign up to 23 points. The second part consists in a programming test. This will assign up to 10 points. 30 cum laude will be assigned when the total score is higher than 31.

Type of assessment

Description

Dublin descriptor

Written test

Solution of numerical problems

·       Source entropy computation

·       Computing distortion introduced by lossy coding techniques

·       Multimedia signals filtering

Exercises focusing on design aspects

·       Development of lossless and lossy coding schemes

·       Parameters selection for multimedia processing algorithms

Theoretical questions on all course topics with open answer

·       Information theory

·       Digital signal processing applied to multimedia objects

·       Coding standards

1, 2

 

 

 

1, 2, 5

 

 

 

1, 5

 

Concerning the assessment of the whole exam (Module I and Module II)

In order to pass the whole Multimedia Signal Processing exam, a positive grade must be obtained in both modules and the final mark will be the average of the two marks rounded up the next integer.

In evaluating the average between the two modules a 30 cum laude mark in a module will be considered as 30; in order to get 30 cum laude as a final mark, the mark of at least one module must be 30 cum laude.

In every exam date one module will take place in the morning while the other in the afternoon (the sequence will be published in the web poliself) but the exams of both modules will take place in the same day; anyway you can choose to take on a single module or both modules in the same day. Once you get a positive mark in a module this mark will be automatically frozen until you take again the same module in a following session.

Once you will get a positive mark in both modules the final mark will be evaluated and then published for the publishing period on the web poliself ; at the end of this period it will be automatically recorded. If you do not want to record that final grade you have to refuse it from the poliself: in that case both marks will be restored in a "frozen" state in order to allow you to take on again a module (or both of them). Anyway if you have a frozen mark in both modules at the end of each exam to which you signed in, the final average mark will be automatically published and, after the publishing period, automatically recorded: so, if you want to improve your final mark you have to remember to refuse the published final grade.


Bibliografia

Software utilizzato
Nessun software richiesto

Forme didattiche
Tipo Forma Didattica Ore di attività svolte in aula
(hh:mm)
Ore di studio autonome
(hh:mm)
Lezione
60:00
90:00
Esercitazione
40:00
60:00
Laboratorio Informatico
0:00
0:00
Laboratorio Sperimentale
0:00
0:00
Laboratorio Di Progetto
0:00
0:00
Totale 100:00 150:00

Informazioni in lingua inglese a supporto dell'internazionalizzazione
Insegnamento erogato in lingua Inglese
Disponibilità di materiale didattico/slides in lingua inglese
Disponibilità di libri di testo/bibliografia in lingua inglese
Possibilità di sostenere l'esame in lingua inglese
Disponibilità di supporto didattico in lingua inglese
schedaincarico v. 1.8.3 / 1.8.3
Area Servizi ICT
09/12/2023