

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The indicator method used in statistical procedures such as genlog and glm to generate design matrices. The method assigns a unique column in the design matrix to each parameter, and the elements of the matrix depend on the factor-level combinations and covariates. The document also discusses redundancy and linear dependency of columns, and provides references to related literature.
Typology: Study notes
1 / 3
This page cannot be seen from the preview
Don't miss anything!


1
The indicator method is used in the GENLOG and the GLM procedures to generate the design matrix corresponding to the design specified. Under this method, each parameter (either non-redundant or redundant) in the model corresponds uniquely to a column in the design matrix. Therefore, the terms parameter and design matrix column are often used interchangeably without ambiguity.
The following notation is used throughout this appendix unless otherwise stated:
n Number of valid observations p Number of parameters X (^) n × p design matrix (also known as model matrix) x (^) ij Elements of X
The design matrix has as many rows as the number of valid observations. In the GLM procedure, an observation is a case in the data file. In the GENLOG procedure, an observation is a cell. In both procedures, the observations are uniquely identified by the factor-level combination. Therefore, rows of the design matrix are also uniquely identified by the factor-level combination.
The design matrix has as many columns as the number of parameters in the model. Columns of the design matrix are uniquely indexed by the parameters, which are in turn related to factor-level combinations.
2 Appendix 9
2
A factor-level combination is contained in another factor-level combination if the following conditions are true:
For example, the combination [A=1] is contained in [A=1][B=3] and so is the combination [B=3]. However, neither [A=3] nor [C=1] is contained in [A=1][B=3].
The design matrix X is generated by rows. Elements of the i th row are generated as follows:
A parameter is redundant if the corresponding column in the design matrix is linearly dependent on other columns. Linear dependent columns are detected using the SWEEP algorithm by Clarke (1982) and Ridout and Cobby (1989). Redundant parameters are permanently set to zero and their standard errors are set to system missing.
References
Clarke, M. R. B. 1982. Algorithm AS 178: The Gauss-Jordan sweep operator with detection of collinearity. Applied Statistics , Vol. 31, No. 2: 166–168.