Mvregress and choice of the estimation algorithm

조회 수: 5 (최근 30일)
Maria445
Maria445 2018년 9월 23일
답변: Ayush Aniket 2024년 12월 20일
I am performing a multivariate regression analysis using mvregress:
[beta,Sigma] = mvregress(X,Y,'algorithm','cwls');
However, I noticed that, although the estimates are identical, the standard errors completely change if I use the cwls algorithm as opposed to the mvn one. In particular, with cwls, almost all coefficient appear as significant, while, using mvn, almost none of them is.
There are no missing responses in my data, so the default would be mvn.
Can a simple algorithm choice be so influential on the result? Is there a way I can choose wisely between the two?

채택된 답변

Ayush Aniket
Ayush Aniket 2024년 12월 20일
The choice of algorithm in multivariate regression can indeed influence the results, particularly the estimation of standard errors becuase of their methods of computing as described below:
  • CWLS (Component-Wise Least Squares) approach estimates the regression coefficients for each response variable separately, which can lead to smaller standard errors if the responses are not highly correlated. It does not account for the covariance between response variables in the error terms, potentially underestimating standard errors if there is significant correlation.
  • MVN (Multivariate Normal): Assumes a multivariate normal distribution for the errors and estimates the coefficients considering the covariance structure between responses This method generally provides more robust standard error estimates when responses are correlated, as it accounts for the covariance.
Some basic suggestions for choosing the right algorithm:
  • Correlation Between Responses: If your response variables are correlated, mvn is usually more appropriate as it accounts for this correlation in the estimation process.
  • Model Assumptions: Consider the assumptions of each method. mvn assumes multivariate normality of errors, which might not hold in all cases.
  • Data Characteristics: If you have a large dataset with complex covariance structures, mvn might be better suited. For simpler cases or when computational efficiency is a concern, cwls might suffice.

추가 답변 (0개)

카테고리

Help CenterFile Exchange에서 Statistics and Machine Learning Toolbox에 대해 자세히 알아보기

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by