DATA ANALYSIS AND STATISTICAL LEARNING
Modulo STATISTICAL LEARNING

Academic Year 2023/2024 - Docente: Salvatore INGRASSIA

Risultati di apprendimento attesi

Il modulo fornisce conoscenze su: i) il problema dell'apprendimento statistico del modello di apprendimento da dati empirici; ii) principali tecnici di apprendimento statistico per regressione e classificazione. 

Course Structure

Lectures and practical data modeling in R.

Required Prerequisites

Main topics in algebra, calculus, geometry, probability (at bachelor level).

Attendance of Lessons

In person.

Detailed Course Content

Statistical Learning. Estimation of dependences based on empirical data. Supervised and Unsupervised Learning. Regression and Classification problems. Parametric and non-parametric models. Assessing Model Accuracy.

Linear Regression. Simple linear regression. Multiple linear regression. Least squares criterion and parameter estimation. Assessing the accuracy of the coefficient estimates and of the model. Use of qualitative predictors. Extension of the linear model and non-linear relationships.

Statistical Learning. Estimation of dependences based on empirical data. Supervised and Unsupervised Learning. Regression and Classification problems. Parametric and non-parametric models. Assessing Model Accuracy.

Linear Regression. Simple linear regression. Multiple linear regression. Least squares criterion and parameter estimation. Assessing the accuracy of the coefficient estimates and of the model. Use of qualitative predictors. Extension of the linear model and non-linear relationships.

Classification. Logistic regression; parameter estimation. Linear and quadratic discriminant analysis.

Resampling methods. Cross-validation, Bootstrap.

Linear model selection and regularization. Subset selection. Shrinkage methods. Dimension Reduction Methods, Modeling in high dimensions.

Tree-based Methods. Regression Trees and Classification Trees. Bagging, Random Forest, Boosting

Textbook Information

1. James G., Witten D., Hastie T., Tibshirani R. (2023). An Introduction to Statistical Learning with Applications in R, 2nd Edition, Springer, New York, https://hastie.su.domains/ISLR2/ISLRv2_corrected_June_2023.pdf

2. Hastie T., Tibshirani R., Friedman (2008). The Elements of Statistical Learning, Springer, New York

3. Course notes


AutoreTitoloEditoreYearISBN
James G., Witten D., Hastie T., Tibshirani R. An Introduction to Statistical Learning with Applications in RSpringer2021
Hastie T., Tibshirani R., Friedman The Elements of Statistical LearningSpringer2008

Course Planning

 SubjectsText References
1Basics of statistical learning.Textbook n.1, chap. 1
2Linear RegressionTextbook n.1, chap. 2
3ClassificationTextbook n.1, par. 4.1-4.5
4Resampling methodsTextbook n.1, chap.5
5Linear Model Selection and RegularizationTextbook n.1, chap. 6
6Tree-based methodsTextbook n.1, chap. 8

Learning Assessment

Learning Assessment Procedures

The evaluation will be based on a data analysis report  and oral exam.

Examples of frequently asked questions and / or exercises

See the course content.
ENGLISH VERSION