Please complete the following steps before the beginning of the workshop on Saturday, January 15. The goal is to set up and familiarize yourself with a toolkit that is suitable for machine learning methods and for the extraction of knowledge from data sets.
Revisit what you learnt about linear and logistic regression in the Workshop on Statistical Methods in November. What is the difference between linear and logistic regression?
Familiarize yourself with the scikit-learn by completing the two (small) tasks in the following Jupyter notebook (also accessible here). You can use either Google Colab or your local Python environment. The following class might be helpful: