Testing goodness of fit: P-value

Question

Yasamin H. T. 2015년 6월 12일

1
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/223503-testing-goodness-of-fit-p-value

답변: Aditya 2024년 1월 31일

There is a continuous data-set, that I'm trying to test the goodness of its fit with chi-square.

I use [h,p,stats] = chi2gof(x,'CDF',pd,'NBins',nb), to test the null hypothesis and goodness of fit.

While the result shows h=0 ( NH is not rejected), p-value shows up as NaN. I even tried to change the number of bins, but p-value is still NaN. Any ideas why?

Thanks!

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Aditya 2024년 1월 31일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/223503-testing-goodness-of-fit-p-value#answer_1400271

When you get a NaN (Not a Number) result for the p-value from the chi2gof function in MATLAB, it typically indicates that there is an issue with the calculation of the p-value. This can happen for several reasons:

All Observed Frequencies Match Expected Frequencies: If your observed frequencies match the expected frequencies exactly (or very closely), the chi-square statistic can be zero or extremely small, leading to numerical difficulties in calculating the p-value.
Insufficient Data: If there are too few data points or if the number of bins (NBins) is too large for the amount of data you have, some bins may end up with expected frequencies that are too low. The chi-square goodness-of-fit test generally requires at least 5 expected occurrences per bin.
Inappropriate Distribution: If the probability distribution object (pd) does not fit the data well or if it's not defined properly, the calculation of expected frequencies might not be valid, resulting in a NaN p-value.

Here are some steps you can take to troubleshoot the NaN p-value:

Check Expected Frequencies: Look at the stats output from chi2gof, which contains the observed and expected frequencies. Ensure that the expected frequencies are all greater than 5 to satisfy the assumptions of the chi-square test.
Adjust the Number of Bins: Try adjusting the number of bins (NBins) to ensure that you have a sufficient number of observations in each bin. You can start with a smaller number of bins and increase it gradually.
Verify the Distribution Fit: Ensure that the probability distribution (pd) you are using is appropriate for your data. You can plot your data against the probability distribution function (PDF) or cumulative distribution function (CDF) of pd to visually inspect the fit.

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

Testing goodness of fit: P-value

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

Testing goodness of fit: P-value

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

답변 (1개)

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기