QA: Why are the coefficient estimates produced by rxLogit sometimes different than those produced by R’s glm() or SAS’s PROC LOGISTIC?

If rxLogit() converges within the maximum number of allowed iterations, and if the same is true of the other program(s), the most likely reason for a difference in estimated coefficients is that different “contrasts” are being used. Another possible reason is that if the model is singular (rank deficient) and variables are dropped to remove the singularities, the values of the remaining coefficients will depend upon which variables are dropped, and this may differ among the programs (this can be viewed as a type of contrast). 

If any of the programs fail to converge, then their estimates are unreliable and cannot be compared. Use of different initial values may lead to convergence. For instance, set the initialValues argument equal to 0 to uses all 0’s as starting values. Or set a vector with the correct number of elements. 

rxLogit displays the condition number of the final variance-covariance matrix as an indicator of the numerical reliability of the results. It also will terminate estimation if the model is too ill-conditioned to yield reliable estimates.
Note This is a "FAST PUBLISH" article created directly from within the Microsoft support organization. The information contained herein is provided as-is in response to emerging issues. As a result of the speed in making it available, the materials may include typographical errors and may be revised at any time without notice. See Terms of Use for other considerations.

Article ID: 3104288 - Last Review: 10/29/2015 09:09:00 - Revision: 1.0

Revolution Analytics

  • KB3104288