Class imbalance is something which most of all the classification problem falls on. It is always good to check the number of observations for each target variable. To be precise, it is something like we get 990 cancer free patients and 10 cancer patients in the data set. While machine will learn a lot about those 990 cancer free patients. But high importance is for those 10 predictions.