MULTIMEDIA DATA MODELLING

Academic Year 2023/2024 - Teacher: LUCA GUARNERA

Expected Learning Outcomes

General educational objectives of teaching in terms of expected learning outcomes:

Knowledge and understanding: the objective of the course is for students to acquire knowledge that enables them to understand basic concepts about digital images and videos, as well as their processing for accurate analysis of low-level discriminative features useful for solving common tasks (face recognition; face detection; etc.)
Ability to apply knowledge and understanding (applying knowledge and understanding): the student will acquire the necessary skills for processing and analyzing multimedia content. In this regard, part of the course will consist of laboratory lectures, with practical examples of programs written in Python language and use of ad-hoc libraries (mainly OpenCV)
Autonomy of judgment (making judgements): The student will be able to independently develop solutions in writing algorithmic solutions for the analysis of mutlimedia content.
Communication skills (communication skills): the student will acquire the main skills in the use of technical language in the general area of digital images and video and the basic concepts of computer vision.
Learning skills: the course aims, as an objective, to provide the student with the necessary theoretical and practical methodologies to be able to address and solve independently new problems that may arise during a work activity. To this end, various topics will be covered in class involving the student in the search for possible solutions to real problems.

Course Structure

Didattica frontale

Required Prerequisites

Basic knowledge of the Python language
Basic concepts of Machine Learning

Attendance of Lessons

Attendance is recommended

Detailed Course Content

The first part of the course is about digital images:

Introduction to digital images + Lab. Session
Interpolation operations: replication, bilinear and bicubic + Lab. Session
Space domain and Frequency domain + Lab. Session
Fourier and DCT Transform + Lab. Session
The convolution and convolution theorem
Lossy and lossless compression
The JPEG standard
Mathematical morphology applied to digital images.
Mathematical morphology applied to gray-scale images + Lab. Session
Image restoration and Noise models + Lab. Session
Filters: arithmetic, geometric, harmonic and counter-harmonic mean + Lab. Session
Median, minimum, maximum, midpoint, + Lab. Session
Adaptive filters + Lab. Session
Periodic noise. Noise removal in the frequency domain + Lab. Session
Filtering in the spatial domain. Edge detector. Canny's algorithm + Lab. Session
Steganography and Steganalysis + Lab. Session

The second part of the course is about digital video:

Introduction to digital video and main definition: aspect ratio, resolution, video file format.
Lab. Session: OpenCV and Digital Video
Video formats (MPEG-1, MPEG-2, MPEG-4, H.264).

The third part of the course covers Low-Level Vision:

Low-Level Vision: Filters and Features: Edges, Textures, Laplacian Pyramid, Corner Detection (Harris, ...), SIFT.
Computer Vision applications: face detection and recognition, etc..

The fourth part of the course concerns modeling and processing of digital data:

Data modeling (features extracted from multimedia contents) and classification tasks

Laboratory Session:

The python language applied to digital image and video processing
Introduction to OpenCV and other image/video processing libraries
Implementation of Computer Vision algorithms (studied in the theoretical part)

Several seminars will be scheduled during the course on different topics related to the topics covered in the course, such as "Hints on Multimedia Content Manipulation and Detection Techniques: from Image Forgery to Deepfakes"

Textbook Information

Digital Image Processing, (3rd Edition) Rafael C. Gonzalez, Richard E. Woods, Ediz. Pearson, Prentice Hall

G. Bradski, A. Kaehler, “Learning OpenCV Computer Vision with the OpenCV Library” O'Reilly Media, 2008;

Mubarak Shah, "Fundamentals of Computer Vision" (pdf), 1997

Richard Szeliski, Computer Vision: Algorithms and Application, Springer 2010

Course Planning

	Subjects	Text References
1	First module: Introduction to Digital Images	Teacher's material
2	Second Module: Introduction to Digital Video	Teacher's material
3	Third module: Introduction to Computer Vision	Teacher's material
4	Fourth module: Modeling and Analysis of Multimedia Data	Teacher's material

Learning Assessment

Learning Assessment Procedures

Written test (Multiple choice test) and Project

Examples of frequently asked questions and / or exercises

1. What is a digital image?

a) A photograph taken with an analogue camera

b) A matrix of light intensity values

c) A compressed video

d) A hand-drawn image

2. What are the types of images?

a) Black and White, Gray and Blue

b) Gray and Green, Color, Pink and Blue

c) Black and White, Greyscale, Color

d) Yellow and Red, Green and Blue, Color

3. What does the RGB color space represent?

a) A series of abstract coordinates

b) A color representation based on red, yellow and blue

c) The gray line

d) A color representation based on red, green and blue

4. What is the spatial resolution of an image?

a) The number of colors present in the image

b) The number of information points (pixels) in the image

c) The physical size of the image

d) The speed at which the image is displayed

5. What are the three main types of frames in videos?

a) I, P, L

b) I, Q, B

c) I, P, B

d) I, X, Z

VERSIONE IN ITALIANO

Degree Course in

Data Science