Predicting Wine Quality with Random Forest and Scikit-Learn

Por: Coursera . en: , ,

In real life we face various classification problems, such as predicting whether an email is spam or not, or whether a credit card transaction is fraudulent or not, or what label the mobile phone should assign to the image it focuses on, perhaps a flower, a dog, a person or something else. Fortunately, we have machine learning techniques to help us deal with this.

In this guided project, we will tackle the problem of predicting red wine quality using a Random Forest Classifier. Specifically, we will implement it by programming with Python and the classifier provided by the Scikit-Learn package.

You will learn to train the classifier, calibrate it, tune its hyperparameters and evaluate the accuracy of its predictions. You will also learn how to perform cluster analysis to handle collinearity and reduce the number of predictors without sacrificing model accuracy. In addition, you will draw various graphs to help you interpret the results.

This project is intended for beginners, so the prerequisites are basic knowledge of Python, Pandas, Numpy, Matplotlib, Seaborn, Scikit-Learn, Scipy and Random Forest algorithms.

Note: This course runs in Rhyme's virtual browser, which is Coursera's hands-on project platform. With this browser you will connect to Google Colaboratory to write and execute Python code in a Jupyter Notebook, without worrying about installing software. All you need is to have a Google account.

This Guided Project was created by a Coursera community member.