the average of multiple Probability density functions

Question

Rima Habib 2019년 12월 5일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/494981-the-average-of-multiple-probability-density-functions

편집: Ridwan Alam 2019년 12월 6일

Temp.mat

Hallo,

I have a cell array of {1000,1} pds, they are all the same distribution, but they differ slightly in the parameter values

Does any one know how to get the average of these pds? I have got them in iterative process and would like now to know the mean or average of them

I though of taking the mean of the parameter values in the pds in for loop but I am truggling with the right way to do it because I keep getting errors.

it would be helpful if some one gives me a hint :)

I have attached the cell array (Temp) in mat file

Thanks in advanced!

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

Image Analyst 2019년 12월 5일

MATLAB Online에서 열기

J, it should work regardless of what toolboxes you have

 s=load('temp.mat')
 cellArray = s.Temp
s = 
  struct with fields:
    Temp: {1000×1 cell}
    
cellArray =
  1000×1 cell array
    {1×1 prob.GeneralizedParetoDistribution}
    {1×1 prob.GeneralizedParetoDistribution}
    {1×1 prob.GeneralizedParetoDistribution}
    {1×1 prob.GeneralizedParetoDistribution}
etc.

Jeff Miller 2019년 12월 5일

What do you mean by "the average of the probability distributions"?

For example, suppose one distribution is normal(0,1) and another distribution is normal(10,2). Is the average normal(5,1.5)? Or is the average bimodal with one mode around 0 and another around 10? Or is the average distribution something else?

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

Answer 1

Ridwan Alam 2019년 12월 5일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/494981-the-average-of-multiple-probability-density-functions#answer_405038

The idea of "average" is ill-defined for pdfs.

In your case, all the 1000 distridutions are generalized paretos. Your theta is fixed in all pdfs, so you can certainly take the average of the sigma's and the k's. But what meaning/significance those "averages" carry, you need to come up with that.

If you are looking for the average expected value, you can calculate the mean (expected value) for each pareto and then average those. mean = theta + (sigma/(1-k)). ref: https://en.wikipedia.org/wiki/Generalized_Pareto_distribution

Also, if you are looking for the overall distribution parameters, you can combine the input data [available in Temp{}.InputData.data] and use gppdf().

Hope this helps.

댓글 수: 2
없음 표시없음 숨기기

Rima Habib 2019년 12월 6일

@Jiff Miller @Ridwan Alam

I am trying to generate artificial Time series, so after an iterative methode, taking the average of the resulted pdfs is the best way to get the gppdf that suits the data.

so the only idea I could came up with is to take the average of the sigmas and ks but I am also interessted in your other approch which is combinig the inputdata then using gppdf.

how to do that exactly with a cell array full of pdfs. I couldnt figure it out?

Thanks you for your Help!

Ridwan Alam 2019년 12월 6일

편집: Ridwan Alam 2019년 12월 6일

MATLAB Online에서 열기

Rima, as I mentioned one approach could be to stack all the inputdata from all the pdfs.

allInputData=[];
for i = 1:length(Temp)
    allInputData = [allInputData; Temp{i}.InputData.data];
end
% in your case, theta is fixed
newTheta = Temp{1}.theta;
paramhat = gpfit(allInputData-newTheta);
newK = paramhat(1);
newSigma = paramhat(2);

Hope this helps.

Please accept the response as an answer, and vote up if you liked the conversation.

댓글을 달려면 로그인하십시오.

the average of multiple Probability density functions

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

채택된 답변

댓글 수: 2
없음 표시없음 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

the average of multiple Probability density functions

댓글 수: 5 이전 댓글 3개 표시이전 댓글 3개 숨기기

채택된 답변

댓글 수: 2 없음 표시없음 숨기기

추가 답변 (0개)

참고 항목

카테고리

태그

Community Treasure Hunt

댓글 수: 5
이전 댓글 3개 표시이전 댓글 3개 숨기기

댓글 수: 2
없음 표시없음 숨기기