Tests to Identify Outliers in Data Series

This document includes several statistical tests to identify outliers in data series.
다운로드 수: 4.9K
업데이트 날짜: 2010/8/18

라이선스 보기

There are several definitions for outliers. One of the more widely accepted interpretations on outliers comes from Barnett and Lewis, which defines outlier as “an observation (or subset of observations) which appears to be inconsistent with the remainder of that set of data”. However, the identification of outliers in data sets is far from clear given that suspicious observations may arise from low probability values from the same distribution or perfectly valid extreme values (tails) for example.

One alternative to minimize the effect of outliers is the use of robust statistics, which would solve the dilemma of removing/modifying observations that appear to be suspicious. When robust statistics are not practical for the problem in question, it is important to investigate and record the causes of the possible outliers, removing only the data points clearly identified as outliers.

Situations where the outliers causes are only partially identified require sound judgment and a realistic assessment of the practical implications of retaining outliers. Given that their causes are not clearly determined, they should still be used in the data analysis. Depending on the time and computing power constrains, it is often possible to make an informal assessment of the impact of the outliers by carrying out the analysis with and without the suspicious outliers.

This document shows different techniques to identify suspicious observations that would require further analysis and also tests to determine if some observations are outliers. Nevertheless, it would be dangerous to blindly accept the result of a test or technique without the judgment of an expert given the underlying assumptions of the methods that may be violated by the real data.

The following tests have been implemented:

• Z-scores
• Modified Z-scores
• Boxplot
• Adjusted Boxplot
• Generalized ESD Procedure
• Grubbs test
• Exponential Smoothing
• Kimber test for exponential distribution
• Moving Window Filtering Algorithm

Also, test files are available to check if the program is functioning on the specific platform.

I hope it will help.

Best wishes,

Francisco Alcaraz

인용 양식

Francisco Augusto Alcaraz Garcia (2025). Tests to Identify Outliers in Data Series (https://www.mathworks.com/matlabcentral/fileexchange/28501-tests-to-identify-outliers-in-data-series), MATLAB Central File Exchange. 검색 날짜: .

MATLAB 릴리스 호환 정보
개발 환경: R2009a
모든 릴리스와 호환
플랫폼 호환성
Windows macOS Linux
카테고리
Help CenterMATLAB Answers에서 Specification Testing에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!
버전 게시됨 릴리스 정보
1.0.0.0