{"id":94,"date":"2022-09-15T14:05:00","date_gmt":"2022-09-15T08:35:00","guid":{"rendered":"https:\/\/www.aclysis.com\/blog\/?p=94"},"modified":"2024-05-10T16:22:11","modified_gmt":"2024-05-10T10:52:11","slug":"outliers-and-their-treatment","status":"publish","type":"post","link":"https:\/\/www.aclysis.com\/blog\/?p=94","title":{"rendered":"Outliers And Their Treatment"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"538\" src=\"https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/1662720845753-1024x538.jpg\" alt=\"\" class=\"wp-image-97\" srcset=\"https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/1662720845753-1024x538.jpg 1024w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/1662720845753-300x158.jpg 300w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/1662720845753-768x403.jpg 768w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/1662720845753.jpg 1280w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">OUTLIERS<br>An outlier is a data point, that is extreme and wide apart or very different from the rest of the data points. Basically, outliers appear due to erroneous entry due to human or system error, erratic production process, malfunctioned machineries etc. Presence of outliers is the indicator of possible irregularities in the system or process. In huge datasets, outliers are predicted and are highly common. A box plot is a popular tool for finding outliers in a dataset.&nbsp;&nbsp;<br><br><br>Box plot is nothing but a five-points plot indicating first quartile (Q1), second quartile \/ median (Q2), third quartile (Q3), upper whisker limit (Q3 + 1.5 IQR) and lower whisker limit (Q1 \u2013 1.5 IQR) where IQR (Inter Quartile Range) is Q3 \u2013 Q1. Any datapoint outside the whiskers limits is treated as OUTLIER.&nbsp;&nbsp;<br><img decoding=\"async\" src=\"https:\/\/media-exp1.licdn.com\/dms\/image\/D5612AQEo6x2buNCQuw\/article-inline_image-shrink_1500_2232\/0\/1662719441923?e=1669248000&amp;v=beta&amp;t=5R3Yu7RczFtfW6BflActo7mM9tKNQVc6RYh_DKTNe4M\" alt=\"No alt text provided for this image\"><br><img decoding=\"async\" src=\"https:\/\/media-exp1.licdn.com\/dms\/image\/D5612AQE25WJrIlbdsw\/article-inline_image-shrink_1500_2232\/0\/1662719746214?e=1669248000&amp;v=beta&amp;t=l4Trp_0nH32tQXohGzV3FWKvuzD_2CutIcE8LxC1k3M\" alt=\"No alt text provided for this image\"><br>                      Boxplot without outlier<br><img decoding=\"async\" src=\"https:\/\/media-exp1.licdn.com\/dms\/image\/D5612AQHRxZlQpaYXfA\/article-inline_image-shrink_1500_2232\/0\/1662719547637?e=1669248000&amp;v=beta&amp;t=5iLX_FwlxdC7Z4aGmzjoLSjD1D77uoe0tN8KEyjT7qY\" alt=\"No alt text provided for this image\"><br>                                 Boxplot with outlier<br><a rel=\"noreferrer noopener\" href=\"https:\/\/www.skillgain.in\/contact.php\" target=\"_blank\"><\/a><br>If outliers are present significantly in any dataset in total or in a particular feature, we should treat outliers before processing of model building as most of the statistical measure are sensitive to these extreme values. For building regression models and forecast, outlier treatment is a must. For classification purpose, outliers do not have much implication as those models are build on the principle of similarity or distance.&nbsp;<br>Outlier treatment means cap the whole dataset within the whisker limits (the capping limits may be differently fixed according to the business needs). It is done by replacing the higher and lower extreme values with upper whisker limit and lower whisker limit respectively.&nbsp;<br><\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/www.aclysis.com\/contact.php\"><img decoding=\"async\" width=\"1024\" height=\"280\" src=\"https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/image-4-1024x280.png\" alt=\"\" class=\"wp-image-103\" srcset=\"https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/image-4-1024x280.png 1024w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/image-4-300x82.png 300w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/image-4-768x210.png 768w, https:\/\/www.aclysis.com\/blog\/wp-content\/uploads\/2022\/09\/image-4.png 1280w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n\n\n\n<p>Image courtesy: https:\/\/help.ezbiocloud.net, https:\/\/justinsighting.com<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Image courtesy: https:\/\/help.ezbiocloud.net, https:\/\/justinsighting.com<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_eb_attr":"","footnotes":""},"categories":[41],"tags":[],"class_list":["post-94","post","type-post","status-publish","format-standard","hentry","category-exploratory-data-analysis"],"_links":{"self":[{"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/posts\/94","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=94"}],"version-history":[{"count":6,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/posts\/94\/revisions"}],"predecessor-version":[{"id":578,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=\/wp\/v2\/posts\/94\/revisions\/578"}],"wp:attachment":[{"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=94"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=94"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.aclysis.com\/blog\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=94"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}