View on GitHub

lectures2024

Machine learning preparatory week @PSL

Discord server

The afternoon are dedicated to practical sessions using Python. Students will be on their own under a weak supervision from the teachers. Students can use the discord server chat to communicate, share information, codes, data and help each other during these session.

These practical sessions will necessitate the use of Python 3 with the standard Scipy ecosystem, Scikit-learn and Pytorch. They will make use of Jupyter notebooks. The easiest way to proceed is to have a gmail account and make use of a remote Google Colab to run the notebooks.

Expected Program

This program is a first draft. It can change. The basic pattern is: course in the morning and labs in the afternoon. Check for updates with the teachers on the first day. You are required to bring your computers for the practical sessions.

Day 1 (Wednesday August 28, 2024):

Day 2 (Thursday August 29, 2024):

Day 3 (Friday August 30, 2024):

Day 4 (Monday September 2, 2024):

Day 5 (Tuesday September 3, 2024):

Lectures

Machine learning part

  1. Machine learning: history, application, successes
  2. Introduction to machine learning
  3. Supervised machine learning models
  4. Scikit-learn: estimation and pipelines
  5. Optimization for linear models
  6. Optimization for machine learning
  7. Deep learning: convolutional neural networks
  8. Unsupervised learning

Spark and Machine Learning

Slides from Dario Colazzo

Practical works

Links open Colab notebooks. You may also clone this repository and work locally.

  1. Wednesday: Python basics and the Corrected notebook
  2. Thursday: Practice of Scikit-learn
  1. Friday: Optimization and the Corrected notebook
  2. Monday: Classification with PyTorch and GPUs

Teachers

Acknowledgements

The slides and notebooks were originally written by Pierre Ablin, Mathieu Blondel and Arthur Mensch.

Some material of this course was borrowed and adapted:

License

All the code in this repository is made available under the MIT license unless otherwise noted.

The slides are published under the terms of the CC-By 4.0 license.