Outliers are observations that lie an abnormal distance from other values in a random sample from a population. They’re usually harmful to most data analysis procedures because they can skew your dataset and give misleading results.
In Python, you can identify and replace these outliers with the 5th and 95th percentile values. This tutorial will guide you on how to go about it.
Continue reading