Defining the 95% of data which are around the mean value

Question

Giorgos Papakonstantinou 2013년 7월 31일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value

For a given set of data, how can I define which of those correspond to the 95% of the data which are around the mean value?

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Jan 2013년 8월 1일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93314

편집: Jan 2013년 8월 1일

MATLAB Online에서 열기

x = rand(1, 1000) - 0.5;
m = mean(x);
dist = abs(x - m);
[sortDist, sortIndex] = sort(dist);
index_95perc = sortIndex(1:floor(0.95 * numel(x)));
x_95percent = x(index_95perc);

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Giorgos Papakonstantinou 2013년 8월 1일

MATLAB Online에서 열기

Thank you Jan. It was easier than I expected. Before your answer I was doing the folllowing:

vals=abs(slope);
[CdfY,CdfX] = ecdf(vals,'Function','cdf');  % compute empirical function
cr=CdfY<0.95;

where vals is my dataset.

댓글을 달려면 로그인하십시오.

Answer 2

Image Analyst 2013년 7월 31일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93230

I'd sort the data using sort(). Then use cumsum() to get the cdf. Normalize the CDF then go from the 2.5% element to the 97.5% element using find() to find the elements (values) where the data starts and stops. It's pretty easy, but let me know if you can't figure it out.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Answer 3

Giorgos Papakonstantinou 2013년 7월 31일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/83647-defining-the-95-of-data-which-are-around-the-mean-value#answer_93253

Thank you for your answer Image Analyst. The data contain also negative values. I am not sure but I think that poses a problem when I normalize the data after the cumsum.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Tom Lane 2013년 8월 1일

It sounds like Image Analyst is talking about the cumsum of a vector that assigns probability 1/N to each of N points. However, you could take the 0.025*N and 0.975*N values from the sorted vector directly, converting the index to an integer as you see fit.

댓글을 달려면 로그인하십시오.

Defining the 95% of data which are around the mean value

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (2개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Defining the 95% of data which are around the mean value

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

추가 답변 (2개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기