Dears Friends,
How can I clean this data?
Anyone have some a sugestion for me?

댓글 수: 2

Mohammad Sami
Mohammad Sami 2020년 3월 26일
If you are running R2019b and later, try the interactive cleaning task in the live editor.
Camilo Costa
Camilo Costa 2020년 3월 26일
Unfortunately I don´t have :/

댓글을 달려면 로그인하십시오.

답변 (3개)

Peng Li
Peng Li 2020년 3월 26일

0 개 추천

Technically, this is not a programming issue. rather, this is an issue about algorithm. It's all depending on what you meant by cleaning. Do you think the spikes are what you want to filtered out? Or do you want to do something else? If the spikes are what you think that shouldn't be filtered out, the simplest way to clean this is by a so called three sigma criterion--anything that is beyong mean+/-3*standard deviation is believed to be outliers. There are other tricks too. So, again, this is about the algorithm not about programming I believe.

댓글 수: 1

Camilo Costa
Camilo Costa 2020년 3월 26일
You are correct. This is about algorithm. If you have some advices, like functions to clean the outliers, smooth some high frequencies, etc., I will aprecciate. Thx

댓글을 달려면 로그인하십시오.

Peng Li
Peng Li 2020년 3월 26일

0 개 추천

A simple work around:
b = DADOSUFCS2(:, 2);
bstd = movstd(b, 100);
thre = nanmean(bstd);
bnew = b(bstd <= thre);

댓글 수: 3

Camilo Costa
Camilo Costa 2020년 3월 26일
편집: Camilo Costa 2020년 3월 26일
Furthermore, data remains that are not real as a result of measurement errors. How do I filter and smooth this data?
Peng Li
Peng Li 2020년 3월 26일
Sorry it's difficult for me to understand what you are trying to ask. What I provided is a simple algorithm based on moving standard deviation. anything whose corresponding moving standard deviation is above a threshold will be treated as outliers in my example.
Peng Li
Peng Li 2020년 3월 26일
How do you know that they are not real? Do you have a specific criterion? If you have, then it is simple. If you don't, you may need to work out a bit more algorithm side as no algorithm is the best for filtering a general data set. You are the best person who knows your data the best.

댓글을 달려면 로그인하십시오.

Camilo Costa
Camilo Costa 2020년 3월 27일

0 개 추천

Anyone already used the interactive cleaning task?

카테고리

도움말 센터File Exchange에서 Statistics and Linear Algebra에 대해 자세히 알아보기

질문:

2020년 3월 26일

답변:

2020년 3월 27일

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by