(Re)Launching my supervised machine learning course

By Julia Silge

September 23, 2019

Today I am happy to announce a new(-ish), free, online, interactive course that I have developed, Supervised Machine Learning: Case Studies in R! 💫

Supervised machine learning in R

Predictive modeling, or supervised machine learning, is a powerful tool for using data to make predictions about the world around us. Once you understand the basic ideas of supervised machine learning, the next step is to practice your skills so you know how to apply these techniques wisely and appropriately. In this course that I have developed, you can work through four case studies using data from the real world; the course focuses on exploratory data analysis, preparing data so it is ready for predictive modeling, training supervised machine learning models, and evaluating those models.

This course approaches supervised machine learning using:

The interactive course site is built on the amazing framework created by Ines Montani, originally built for her spaCy course. The front-end is powered by Gatsby and Reveal.js and the back-end code execution uses Binder. It can take a little while for Binder to spin up a new Docker container the first time you run a code exercise in a new session, so be patient when that happens! ⏳

Florencia D’Andrea helped port the course materials and made the fun logo; I am quite happy that I hired Florencia to work with me on this. You can find the code that builds my course on GitHub. To learn more about building a course on this framework, see Ines’s starter repos for making courses in Python and R, and her explanation of how the framework works at the original course repo.

A little context

If you have been following news in the data science world over the past months, you have probably guessed that this is a revamp of one of my DataCamp courses. The contract I have with DataCamp does not allow me to have my courses removed from that platform (although they have removed my name from my courses, not at my request), but I am allowed to republish the content elsewhere.

One difference from this course’s previous incarnation is that there is no video component (I do not own any rights to the videos). I will be interested to see how people respond to this change. I know the videos can be fun and people respond to them. At the same time, there is a growing body of research and thought around the limitations of video in online learning environments. We’ll see what learners gain from this version of the course.

One benefit of having more influence on the content of the course now is that I can update it much more easily. My plan is to move the content from caret completely over to tidymodels as the tidymodels infrastructure continues to mature. Probably about when I move my IRL workflow over!

Try it out

If you want to practice your ML skills, give the course a go. 🚀 Contributions and comments on how to improve this course are welcome! Please file an issue or submit a pull request if you find something that could be fixed or improved.

Posted on:
September 23, 2019
Length:
3 minute read, 534 words
Tags:
rstats
See Also:
Topic modeling for #TidyTuesday Spice Girls lyrics
Predicting viewership for #TidyTuesday Doctor Who episodes
Spatial resampling for #TidyTuesday and the #30DayMapChallenge