Rolling Beta For Multiple x and y variables simultaneously

조회 수: 4 (최근 30일)
Dwight Schrute III
Dwight Schrute III 2019년 9월 4일
댓글: Dwight Schrute III 2019년 9월 9일
Hello,
I have two matrices: One is an x-variable matrix (for example, 8 x variables as columns, with 1,000 rows, where each row represents a day). And one is a y-variable matrix (for example, 20 y variables as columns, with the same 1,000 rows).
I would like to calculate a matrix C that produces a rolling 100-day beta of each y variable to each x variable. Thus, C would have 20 * 8 = 160 columns. And moreover, since it's a rolling beta, the number of rows would be (1,000-100+1) = 901 rows (since the first 99 days wouldn't be eligible for a 100-day beta).
I have been playing around with various functions, e.g., corr, polyfit, and regress. However, none of these appear to address my query on rolling betas. In fact, I'm not sure I even see the ability to implement a rolling beta for just one variable in each matrix.
I would appreciate any guidance on this. Thank you!
  댓글 수: 10
the cyclist
the cyclist 2019년 9월 4일
Another clarification ...
As you have pointed out, the number of coefficients you'll be calculating is
(number y's) * (number x's) * (number windows) = 20 * 8 * 901.
How do you want those arranged in the output? In a 20 x 8 x 901 numeric array? Or something else?
Dwight Schrute III
Dwight Schrute III 2019년 9월 4일
Yes, a 20 x 8 x 901 matrix would be ideal, though it doesn't really matter much as long as we know what the dimensions represent. I can always use reshape to convert it to a format that would be desirable.

댓글을 달려면 로그인하십시오.

채택된 답변

the cyclist
the cyclist 2019년 9월 5일
편집: the cyclist 2019년 9월 6일
I believe this does what you intend.
% Set seed for reproducibility
rng default
% Set a few convenience parameters
N = 1000;
WINSIZE = 100;
XN = 8;
YN = 10;
% Simulate some data
x = randn(N,XN);
y = randn(N,YN);
% Calculate the number of window
numberWindows = N - WINSIZE + 1;
% Preallocate the output
output = zeros(YN,XN,numberWindows);
% Loop over the windows
for nw = 1:numberWindows
% Find the data for this window
thisWindowIndex = nw:(nw+WINSIZE-1);
thisWindowXData = x(thisWindowIndex,:);
thisWindowYData = y(thisWindowIndex,:);
for ny = 1:YN
for nx = 1:XN
% Solve the regression (returns intercept and slope)
tmp = [ones(WINSIZE,1) thisWindowXData(:,nx)]\thisWindowYData(:,ny);
% Store the slope
output(ny,nx,nw) = tmp(2);
end
end
end
  댓글 수: 1
Dwight Schrute III
Dwight Schrute III 2019년 9월 9일
This is fantastic. Simple and effective. I also learned something new, i.e., that the backwards slash (mldivide) can naturally be used for a conventional beta calculation.
Separately, I thought I'd also provide some code for a function I created. It generates a rolling beta for a given set of x and y variables (in the form of a matrix). This function uses the built-in "mov" functions, so the general methodology and output format follow their protocol. What's neat about this function is that no loops are involved. (Also, "n" in this case refers to the length of the rolling observation window.)
function bet = movbeta(y,x,n)
bet = (movsum(x.*y,[n-1 0],'e','d')+movmean(x,[n-1 0],'e','d').*movmean(y,[n-1 0],'e','d')*-n)./movstd(x,[n-1 0],'e','d').^2/(n-1);
end

댓글을 달려면 로그인하십시오.

추가 답변 (1개)

John D'Errico
John D'Errico 2019년 9월 4일
편집: John D'Errico 2019년 9월 4일
Is the x vector equally spaced? If so, then my movingslope code (found on the File Exchange) will do it trivially and efficiently.
If not, then nothing stops you from using a loop and polyfit. It still will be reasonably efficient. You could make it a little faster with carefully written code than polyfit, but why bother?
  댓글 수: 3
John D'Errico
John D'Errico 2019년 9월 4일
If the points are not evenly spaced, then the regression matrix changes for each location. You could write code that would work, not using a loop. It would look more elegant. It might take more memory though.
For example, you could write it using an update and downdate for a QR decomposition. Adding one point at the end, then dropping the first point. It would still be a loop. And the update/downdate would be slower then just throwing backslash at it, or even polyfit.
Or, given a simple regression for just a simple slope, you could do effectively the same thing. The formula for the slope is easy to write down. So, again, it would be easy to do, though still a loop.
Is this something you will be doing often? If so, then it would be worth the programmer time to do it better. But for a one shot deal, I'd not bother. CPU time is really cheap, and for a problem that is not a bottleneck in your task, a loop is easy.
Dwight Schrute III
Dwight Schrute III 2019년 9월 9일
It is indeed something I'd be doing often. But your point is a good one, i.e., that a loop might be sufficient for this purpose.
As I mentioned to the cyclist, I also created a function that generates a rolling beta for a given set of x and y variables (in the form of a matrix). This function uses the built-in "mov" functions, so the general methodology and output format follow their protocol. What's neat about this function is that no loops are involved. (Also, "n" in this case refers to the length of the rolling observation window.)
function bet = movbeta(y,x,n)
bet = (movsum(x.*y,[n-1 0],'e','d')+movmean(x,[n-1 0],'e','d').*movmean(y,[n-1 0],'e','d')*-n)./movstd(x,[n-1 0],'e','d').^2/(n-1);
end

댓글을 달려면 로그인하십시오.

카테고리

Help CenterFile Exchange에서 Model Building and Assessment에 대해 자세히 알아보기

태그

제품


릴리스

R2019a

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by