|
Academic Year |
2024/2025 |
Name |
Dott. - MI (1383) Ingegneria Meccanica / Mechanical Engineering |
Programme Year |
1 |
ID Code |
058880 |
Course Title |
STATISTICS IN THE BIG DATA ERA |
Course Type |
MONO-DISCIPLINARY COURSE |
Credits (CFU / ECTS) |
5.0 |
Course Description |
In the heart of industry 4.0 revolution is the aspect of big data. In this course we will view how big data affect the existing statistical methods, recognize the issues and propose tools that are capable to overcome the problems caused by the growth in the data size & dimension. We will start by recognizing the different types of big data (tall, wide, asynchronous, unstructured etc.) and we will present historic big data failures to acknowledge the challenges. We will present visualization principles, show platforms that facilitate visualization of complex data, talk about the concepts of statistical versus the practical significance and work on the issues of multiple hypothesis testing and correction. Statistical principles of data reduction (sufficiency and likelihood) will be provided. Within the supervised statistical learning, the descriptive versus predictive modeling approach will be discussed highlighting the differences between machine learning and statistics. Next we will focus into regression where we will talk about variable/model selection aspects along with shrinkage/regularization (like ridge, lasso etc.) and other penalizing methods. This material will be extended to generalized linear models and discriminant analysis as well. Within unsupervised statistical learning the topics of principal components analysis and cluster analysis will be covered. During the course big data from real studies will be used to present the material and students will be motivated to use data from their own research area to work on the various topics taught in class. |
Scientific-Disciplinary Sector (SSD)
|
--
|
Alphabetical group
|
Name
|
Teaching Assignment Details
|
From (included)
|
To (excluded)
|
A
|
ZZZZ
|
Tsiamyrtzis Panagiotis
|
|
|