Machine Learning.

Posts

Handle Imbalanced Dataset

- July 26, 2020

Handle Imbalanced Dataset - (Along with Implementation in python!) Let's take an example of Cancer Patient dataset where we are checking whether a person is having cancer or not based upon the input features. Suppose in our dataset we have 1000 records and out of those 1000, 900 are the ones having cancer and rest 100 is non-cancer patient data. So it is clearly an example of imbalance dataset as we have more number of rows with people having cancer than not having cancer. So if we train our data with this imbalance dataset and test it later with the new testing data, our model will be a lot partial towards the people having cancer as we have trained our model with the imbalanced dataset and thus our model accuracy with being very less in that case. So How do we handle the Imbalance Dataset? Let look at some of the great techniques to avoid this kind of problem and train our model in a more precise way. Now, we will use a couple of techniques to resolve this imbalanced datas...

Search This Blog

Machine Learning.

Posts

Random Forest (Easily Explained)

Handle Imbalanced Dataset