Latin Hypercube sampling from distrete, non-uniform distribution
이전 댓글 표시
I have a series of independent events that are assumed to indiviudally have poisson distributions.
I first calculate the number of events in an annual period by using the compound poisson first principles.
Then I need to sample from the data to see which events occur (the number of events ranges for any given year range from 0 to 5). USing brute force monte carlo, I get convergence in the tail characteristics but only after extreme numbers of sampling.
I would like to reduce the number of samples while preserving the representative distributions evn when sampling in years with mroe than 1 event. For loops do not preserve the full distributions outside of the loop.
I do not see any natural latin hyper cube functionality within the statsitics toolbox that uses discrete distributions and perserves distributions for multiple event years.
Thansk in advance.
답변 (2개)
Tom Lane
2012년 6월 2일
0 개 추천
I can't think of any good way to use the latin hypercube feature here. You can apply poissinv to a distribution inside the unit cube, but I suspect that will not help.
I don't completely understand your scenario. If your total event number is the sum of independent Poisson values, then the sum itself is also Poisson. Furthermore, I believe the individual values, conditional on the sum, have a multinomial distribution. Can you make use of that?
Tom Lane
2012년 6월 2일
This is probably not what you want, but if you explain further why it's not what you want, maybe we can come closer to a solution. To sample intervals of unequal length or probability:
edges = [0 .4 .5 .8 1]';
[~,b] = histc(rand(1000,1),edges);
or
v = mnrnd(1,diff(edges),1000);
b = v*(1:4)';
To sample within a specified range of probabilities:
d = diff(edges);
[b, edges(b)+d(b).*rand(size(b))]
댓글 수: 6
David
2012년 6월 4일
David
2012년 6월 4일
Tom Lane
2012년 6월 4일
If you want non-random sampling along a grid of probability values, this seems reasonable. One thing I note is that this implicitly treats the categories as ordered. A consequence is that a category with probability 0.15 might overlap one or two "tenth percentile" values depending on the ordering of it among the other categories. For instance, it might span the interval [.21,.36] (one percentile) or [.36,.51] (two percentiles). I'm not in a position to declare that this is good or bad for your case.
David
2012년 6월 4일
David
2012년 6월 4일
Tom Lane
2012년 6월 5일
Can you not just sample, with replacement, from a multinomial distribution based on the separate Poisson rates? That may give you multiple cases of the same discrete event from the original distribution. But if the count for that event follows a Poisson distribution itself, I would have thought that was okay.
카테고리
도움말 센터 및 File Exchange에서 Univariate Discrete Distributions에 대해 자세히 알아보기
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!