PSL Preparatory Weeks


Preparation courses on machine learning, 2023

This year, the transverse program for data sciences of PSL proposes two preparatory weeks :

  • The preparatory week on basics in math and data science (starting the 21th of August). The students will follow it remotely and autonomously. For each part, video lectures, exercise sheets and lab sessions (Python notebooks) will be provided.
  • The preparatory week on machine learning, bigdata, and ethics (from the 30th of August to the 6th of September 2023)

Both weeks are open to all master and PhD students from PSL. Subject to availabilities, students from other universities can also attend. The registration is free but mandatory. See the two links below, one for each week.

Preparatory week on basics (remote)

This first preparatory week will “start” on Monday, August 29. By the way you can start before: all you need is your moodle account (see the section on registration). The students will follow it remotely and autonomously. For each of the five parts, video lectures, exercise sheets and lab sessions under the form of Python notebooks will be provided. For each part, the theoretical content (lectures and exercises) should represent about a half day of work (around four hours), and the lab session also.

It is mandatory to follow this week before the advanced one !!

Pre-registration

You can pre-register here to the course.

PSL students have priority if they pre-register before August 15th.

To have access to the moddle, you need to register before August the 20th.

Expected roadmap

With this organization in mind, the schedule is as follows :

  • Reza Hatami: Functions and sequences
  • Gwendoline de Bie: Basics of linear algebra
  • Pierre Ablin Differential: calculus and PCA
  • Madalina Olteanu: Statistics
  • Pierre Senellart: Databases

Q&A sessions

In addition, sessions of questions and answers will be organized : the teachers of the first four parts will make themselves available through Teams in the afternoon, during two hours, and will answer the questions of the students.

Preparatory week on machine learning, bigdata and ethics

Dates and location

Dates: 30 August to 6 Sept. 2023.

Location: Paris Santé Campus

Pre-registration

You can pre-register here to the course.

PSL students have priority if they pre-register before August 15th.

Expected Program

This program is a first draft. It can change. The basic pattern is: course in the morning and labs in the afternoon. Check for updates with the teachers on the first day. You are required to bring your computers for the practical sessions.

Day 1 (Wednesday August 30, 2023):

  • 9:00–10:30: (course) Machine learning: recent successes.
  • 11:00-12:30: (course) Introduction to machine learning.
  • 14:00-17:30: (lab session) Introduction to Python and Numpy for data sciences.

Day 2 (Thursday August 31, 2023):

  • 9:00–10:30: (course) Machine learning models (linear, trees, neural networks).
  • 11:00-12:30: (course) Scikit-learn: estimation/prediction/transformation.
  • 14:00-17:30: (lab session) Practice of Scikit-learn.

Day 3 (Friday September 1, 2023):

  • 9:00-12:30: (course) The linear model, optimization
  • 14:00-17:30: (lab session) Logistic regression with gradient descent.

Day 4 (Monday September 4, 2023):

  • 9:00-10:30: (course) Introduction to Deep-Learning
  • 11:00-12:30 (lab session) Practical session

Day 5 (Tuesday September 5, 2023):

  • 9:00-10:30: (course) Introduction to unsupervised learning
  • 11:00-12:30 (course) Practical session
  • 14:00-16:00: (course/lab session) Spark for ML, part 1

Day 6 (Wednesday, September 6 2023):

  • 14:00-16:00: (course/lab session) Spark for ML, part 2

Course materials

Slides and suplementary materials for the lecture and numerics are available..

Practical information

The afternoon are dedicated to practical sessions using Python. Students will be on their own under a weak supervision from the teachers. Students can use the discord server chat to communicate, share information, codes, data and help each other during these session.

These practical sessions will necessitate the use of Python 3 with the standard Scipy ecosystem, Scikit-learn and Pytorch. They will make use of Jupyter notebooks. The easiest way to proceed is to have a gmail account and make use of a remote Google Colab to run the notebooks. If you are not confortable with this, or if you want to run the code locally, you need to install the requested python package, preferably using Anaconda.

Teachers