Machine Learning on a Cancer Dataset - Part 1

in #machine-learning7 years ago

In this series I'm going to explore the cancer dataset that comes pre-loaded with scikit-learn. Scikit-learn is a machine learning library in Python. It has a few datasets that you can practice on, to get you started.

What's advantageous about this is that you don't have to go through the hurdles of data cleaning and processing (wrangling) which a very burdensome process, as many data scientist might know well. With a clean dataset, we can focus on applying different algorithms, classifiers in this case, of ML.

The purpose is to train the classifiers on this dataset, which consists of labeled data: ~500 tumor samples, each labeled malignant or benign, and then use them on new, unlabeled data. In this first introductory video, I explain a bit more.



To stay in touch with me, follow @cristi

#machine-learning #science #python


Cristi Vlad, Self-Experimenter and Author

Sort:  

hello...how can i contact u..i would like some advice on ML

send me a message on youtube or my facebook page.

Coin Marketplace

STEEM 0.20
TRX 0.12
JST 0.029
BTC 61795.04
ETH 3458.89
USDT 1.00
SBD 2.52