Model Data Fit - Basic Statistics for Behavioral Sciences - Lecture Notes, Study notes of Statistics for Psychologists

Model Data Fit, Assumption Checking, Multidimensional Models, Prediction Checking, Equal Discrimination Index Checking, Checking Ability Parameter, Goodness Of Fit are learning points of this lecture.

Typology: Study notes

2011/2012

Uploaded on 11/21/2012

ashakiran
ashakiran 🇮🇳

4.5

(27)

261 documents

1 / 2

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
26
Ch. 4: Model-Data Fit
I. Introduction
A. IRT has different models (1, 2, 3 plm, and uni- and multidimensional models).
B. If a model does not fit the data, IRT will lose its advantages over CTT.
C. Three methods of checking the model-data fit.
1. Assumption checking.
2. Invariance checking.
3. Prediction checking.
II. Assumption checking
A. Unidimensionality checking
1. Factor analysis for a one-factor solution on inter-item correlation matrix
(tetrachoric). Looking for one dominant eigenvalue.
2. Local independence can be checked by investigating the variance-
covariance matrix of items.
3. The b-value can be checked from two different ability groups. If the b-
values are in linear relationship between the two groups, then the
unidimensionality assumption is met.
B. Equal discrimination index checking (a)
1. If the equal discrimination index assumption is violated, the Rasch model
(1-plm) is not valid for the data.
2. The item-test correlation distribution can be used. If it is homogeneous,
the a-value is equal.
C. Minimal guessing checking (c-parameter)
1. If the c-parameter is not minimal, the 3-pl model is valid.
2. If the performance of the low-ability students on the most difficult items is
close to zero, the c-parameter is minimal.
D. Non-speeded test checking
1. The variance of number of omitted items and the variance of number of
incorrectly answered items can be checked. If the ratio of two variances
(O/I) is close to zero, the assumption of non-speeded test is met.
2. The test scores of examinees under the specific time limit and without a
time limit can be compared. If they are similar to each other, the
assumption is met.
3. The percentage of examinees completing the test, percentage of examinees
completing 75% of the test, and the number of items completed by 80%
ofthe examinees can be reviewed. If nearly all examinees complete nearly
all of the items, speed is not an important factor.
III. Invariance checking
A. Checking ability parameter ( )
1. Make two tests with different b-values for a unidimensional ability item
bank.
2. Administer the two tests to a group of examines.
Docsity.com
pf2

Partial preview of the text

Download Model Data Fit - Basic Statistics for Behavioral Sciences - Lecture Notes and more Study notes Statistics for Psychologists in PDF only on Docsity!

Ch. 4: Model-Data Fit I. Introduction A. IRT has different models (1, 2, 3 plm, and uni- and multidimensional models). B. If a model does not fit the data, IRT will lose its advantages over CTT. C. Three methods of checking the model-data fit.

  1. Assumption checking.
  2. Invariance checking.
  3. Prediction checking.

II. Assumption checking A. Unidimensionality checking

  1. Factor analysis for a one-factor solution on inter-item correlation matrix (tetrachoric).  Looking for one dominant eigenvalue.
  2. Local independence can be checked by investigating the variance- covariance matrix of items.
  3. The b-value can be checked from two different ability groups. If the b- values are in linear relationship between the two groups, then the unidimensionality assumption is met. B. Equal discrimination index checking (a)
  4. If the equal discrimination index assumption is violated, the Rasch model (1-plm) is not valid for the data.
  5. The item-test correlation distribution can be used. If it is homogeneous, the a-value is equal. C. Minimal guessing checking (c-parameter)
  6. If the c-parameter is not minimal, the 3-pl model is valid.
  7. If the performance of the low-ability students on the most difficult items is close to zero, the c-parameter is minimal. D. Non-speeded test checking
  8. The variance of number of omitted items and the variance of number of incorrectly answered items can be checked. If the ratio of two variances (O/I) is close to zero, the assumption of non-speeded test is met.
  9. The test scores of examinees under the specific time limit and without a time limit can be compared. If they are similar to each other, the assumption is met.
  10. The percentage of examinees completing the test, percentage of examinees completing 75% of the test, and the number of items completed by 80% ofthe examinees can be reviewed. If nearly all examinees complete nearly all of the items, speed is not an important factor.

III. Invariance checking A. Checking ability parameter ( )

  1. Make two tests with different b-values for a unidimensional ability item bank.
  2. Administer the two tests to a group of examines.

Docsity.com

  1. The

scores from two tests should make a linear regression with the slope of 1 and intercept of 0 within the measurement error range if

is invariant.

  1. Otherwise, some assumptions are violated. B. Checking item parameters
  2. Set two different ability groups with respect to a given trait.
  3. Administer a set of items to each of the two groups.
  4. Item parameter estimates from two groups should be similar to each other within the measurement error range if invariance of item parameter estimates is to be satisfied. C. Checking model predictions
  5. Divide a group of examinees into 10-15 subgroups according to their ability level (ability category). The ability categories should be wide enough to have sufficient sample size and small enough to have a homogeneous ability level.
  6. Compute eij = Pij – E (Pij) where eij = raw residual for item i and category j, Pij = observed proportion of correct responses on item i and category j, and E (Pij) = expected proportion of correct responses on item i and category j.
  7. Pij is the same p-value in CTT for each item and each ability category.
  8. E (Pij) can be either the probability of the mid-point of in the ability category or the mean probability of correct responses in the ability category.
  9. eij does not consider the sampling error associated with E (Pij).
  10. Standardized residual zij

zij =

j

ij ij

ij ij

N

E P E P

P E P

( )[ 1 ( )]

where Nj = number of examinees in the ability category.

  1. 2 -test can be done for a goodness-of-fit test. m

j

obs zij 1

(^22) , df = m – k

where m: number of ability categories, and k: number of parameters in the IRT model.

  1. If we reject Ho from the 2 -test, the model does not fit the data.
  2. Some graphical methods to test the goodness-of-fit (pp.63-73).

Docsity.com