logo-polimi
Loading...
Risorse bibliografiche
Risorsa bibliografica obbligatoria
Risorsa bibliografica facoltativa
Scheda Riassuntiva
Anno Accademico 2017/2018
Scuola Scuola di Ingegneria Industriale e dell'Informazione
Insegnamento 091032 - AUDIO AND VIDEO SIGNALS
Docente Bestagini Paolo , Marcon Marco
Cfu 10.00 Tipo insegnamento Corso Integrato

Corso di Studi Codice Piano di Studio preventivamente approvato Da (compreso) A (escluso) Insegnamento
Ing Ind - Inf (Mag.)(ord. 270) - MI (474) TELECOMMUNICATION ENGINEERING - INGEGNERIA DELLE TELECOMUNICAZIONI*AZZZZ091032 - AUDIO AND VIDEO SIGNALS
091042 - VIDEO SIGNALS
091041 - AUDIO SIGNALS
Ing Ind - Inf (Mag.)(ord. 270) - MI (487) MATHEMATICAL ENGINEERING - INGEGNERIA MATEMATICA*AZZZZ095978 - AUDIO AND VIDEO SIGNALS

Programma dettagliato e risultati di apprendimento attesi

 This course is divided in two Modules: "Audio Signals", concerning fundamentals and advanced applications of audio and acoustic signal processing; and "Video Signals", focusing on image and video processing theory and applications.

 

"Audio Signals" Module

In this course module we first present the fundamental tools for analyzing, synthesizing and processing sounds (voice, music, acoustic signals). We then show how to use such tools for developing a wide range of applications, ranging from music information retrieval to acoustic array processing.

 

Fundamentals

  • physiology of the hearing system and psycho-perception of sound
  • elements of acoustics
  • basic audio analysis: time-frequency analyisi with Short-Time Fourier Transform or filterbanks
  • sound processing tools: filtering, nonlinear processing, auralization filters
  • adaptive filtering: Wiener-Hopf filter, steepest descent, LMS
  • space-time (array) processing: microphone arrays, beamforming
  • feature extraction and analysis
  • sound synthesis tools: modulated delay lines, tunable delay lines, digital waveguides

 

Applications

  • harmonic analysis: pitch tracking, vocoder, envelope tracking
  • sound modification: time warping (resampling), time and pitch scaling (tonal and rhythmic corrector)
  • modulated delay lines and sound effects for musical applications: flanger, chorus, distortions, etc.
  • sound reverberation: perceptual methods, physics-inspired methods, geometric methods (acoustic ray and beam tracing)
  • music information retrieval: feature-based analysis and classification of musical excerpts, playlist generation, mood extraction, etc.
  • adaptive sound processing and applications: echo cancellation, noise reduction, dereverberation, etc.
  • array processing: beamfornimg, acoustic source localization and extraction (demixing), acoustic room compensation

 

"Video Signals" Module

Visual information plays an important role in almost all areas of our life. Today, much of this information is represented and processed digitally. Digital image processing is ubiquitous, with applications ranging from television to tomography, from photography to printing, from robotics to remote sensing.

 

 "Video Signals" is a graduate-level introductory course to the fundamentals of digital image processing. It emphasizes general principles of image processing, rather than specific applications. We expect to cover topics such as

 

  • image sampling and quantization,
  • colors and colorimetry,
  • spatial and frequency operations,
  • image segmentation,
  • morphological image processing,
  • linear image filtering and correlation,
  • Image Spectral analysis based on Fourier transform,
  • noise reduction and restoration,
  • Deconvolution and blind deconvolution,
  • Image and Video Compression,
  • spatio-temporal sampling,
  • motion analysis and tracking,
  • multiple image stitching for panoramic images and videos,
  • High Dynamic Range Images.

 

Lectures will alternate with computer exercise sessions with MATLAB Image Processing Toolbox.


Note Sulla Modalità di valutazione

The grades of the two modules are independently given, and then averaged to a final grade.

 

"Audio Signals" Module: Students are required to take a written test. The resulting grade is truncated to 27/30 unless completed with a project (typically a report and/or a application developed in Matlab® or Python), which needs to be previously negotiated with and authorized by a course instructor. The final grade will be the average of the grades achieved in the written test and in the project.

 

"Video Signals" Module: Students are required to take a written test. Students can decide to improve their grade by developing a project, which needs to be previously negotiated with and authorized by a course instructor.

 

In order to pass the whole “Audio and Video Signals” exam (first and second module, for students that are not following just the Video module), a positive grade must be obtained in both modules and the final mark will be the average of the two marks rounded up the next integer. In evaluating the average between the two modules a 30 cum laude mark in a module will be considered as 30; in order to get 30 cum laude as a final mark, the mark of at least one module must be 30 cum laude.


In every exam date the two modules will take place the same day one after the other (more details will be provided by the web poliself); however students can choose to take on a single module or both modules in the same day. Once a student gets a positive mark in a module this mark will be automatically frozen until she/he takes again the same module in a following session.


Once a student gets a positive mark in both modules the final mark will be evaluated and then published for the publishing period on the web poliself ; at the end of this period it will be automatically recorded. If the student does not want to record that final grade he/she has to refuse it from the poliself: in that case both marks will be restored in a "frozen" state in order to allow the student to take on again a module (or both of them). However if the student has a frozen mark in both modules at the end of each exam to which his/her signed in, the final average mark will be automatically published and, after the publishing period, automatically recorded: so, if the student wants to improve the final mark he/she has to remember to refuse the published final grade.


Bibliografia

Software utilizzato
Nessun software richiesto

Mix Forme Didattiche
Tipo Forma Didattica Ore didattiche
lezione
60.0
esercitazione
40.0
laboratorio informatico
0.0
laboratorio sperimentale
0.0
progetto
0.0
laboratorio di progetto
0.0

Informazioni in lingua inglese a supporto dell'internazionalizzazione
Insegnamento erogato in lingua Inglese
Disponibilità di materiale didattico/slides in lingua inglese
Disponibilità di libri di testo/bibliografia in lingua inglese
Possibilità di sostenere l'esame in lingua inglese
Disponibilità di supporto didattico in lingua inglese
schedaincarico v. 1.7.2 / 1.7.2
Area Servizi ICT
03/07/2022