greends-pml

Links and exercises for the course Practical Machine Learning, Green Data Science, 2o semester 2024/2025


Instructor: Manuel Campagnolo, ISA/ULisboa

The course will follow a mixed flipped classroom model, where students are supposed to work on suggested topics autonomously before classes. Work outside class will be based on a range of Machine Learning resources including the book Sebastian Raschka, Yuxi (Hayden) Liu, and Vahid Mirjalili. Machine Learning with PyTorch and Scikit-Learn. Packt Publishing, 2022. During classes, Python notebooks will be run on Google Colab.

Links for class resources:

Overview notebook This notebook provides an overview of the full course and contains pointers for other sources of relevant information and Python scripts.


Sessions: Each description below includes the summary of the topics covered in the session, as well as the description of assignments and links to videos or other materials that students should work through.

Introduction (Feb 21, 2025)

The goal of the first class is to give an introduction to ML and also to show some of the problems that can be addressed with the techniques and tools that will be discussed during the semester. The examples will be run on Colab.

Basic concepts (Feb 28, 2025): model, loss, fit, learning rate, perceptron, ...

The goal of the following classes is to understand how ML models can be trained in and used to solve regression and classification problems. We start by applying the machine learning approach to well-known statistical problems like linear regression to illustrate the stepwise approach followed in ML. We use synthetic data generated from a linear or quadratic regression, where one can control the underlying model and the amout of noise. Then, we consider the Iris tabular data set with 4 explanatory variables and categorical label that can be one of three species.


Pseudo code for SGD (stochastic gradient descent) to fit a linear regression:



Main on-line resources