Machine Learning on a Cancer Dataset - Part 16

in #machine-learning7 years ago

In this machine learning video, we're gonna start exploring Random Forests in scikit-learn.

Random Forests are algorithms that can be used for both classification and regression tasks. We're gonna used them for classification. Most simply put, random forests are collections of decision trees (which we've explored in previous videos) with the caveat that they may be more advantageous due to the randomization that is applied for each tree comprising the forest.

In this specific video, I discus a few introductory concepts of random forests and then implement a classifier to our cancer dataset. For the detailed walkthrough, please see the video below.


As a reminder:

In this series I'm going to explore the cancer dataset that comes pre-loaded with scikit-learn. The purpose is to train the classifiers on this dataset, which consists of labeled data: ~569 tumor samples, each labeled malignant or benign, and then use them on new, unlabeled data.


Previous videos in this series:

  1. Machine Learning on a Cancer Dataset - Part 11
  2. Machine Learning on a Cancer Dataset - Part 12
  3. Machine Learning on a Cancer Dataset - Part 13
  4. Machine Learning on a Cancer Dataset - Part 14
  5. Machine Learning on a Cancer Dataset - Part 15


To stay in touch with me, follow @cristi


Cristi Vlad, Self-Experimenter and Author

Coin Marketplace

STEEM 0.20
TRX 0.12
JST 0.028
BTC 64453.36
ETH 3507.44
USDT 1.00
SBD 2.56