Machine Learning on a Cancer Dataset - Part 15

in #machine-learning8 years ago

In this final video on Decision Tree, we recapitulate the advantages and disadvantages of this classifier for machine learning purposes. To be more specific, we've been using Decision Trees in scikit-learn to learn to classify tumors into benign or malignant.

We also look at the parameters that can be used/tuned to optimize this classifier so that it can lead to better decision making.

In the next iteration, we'll be looking into ensembles of trees as classifiers. These algorithms are often used as a better alternative to decision trees because they are less 'prone' to overfitting.


As a reminder:

In this series I'm going to explore the cancer dataset that comes pre-loaded with scikit-learn. The purpose is to train the classifiers on this dataset, which consists of labeled data: ~569 tumor samples, each labeled malignant or benign, and then use them on new, unlabeled data.


Previous videos in the series:

  1. Machine Learning on a Cancer Dataset - Part 11
  2. Machine Learning on a Cancer Dataset - Part 12
  3. Machine Learning on a Cancer Dataset - Part 13
  4. Machine Learning on a Cancer Dataset - Part 14


To stay in touch with me, follow @cristi

#machine-learning #science #python


Cristi Vlad, Self-Experimenter and Author

Coin Marketplace

STEEM 0.15
TRX 0.16
JST 0.028
BTC 67340.80
ETH 2419.68
USDT 1.00
SBD 2.35