Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Wallin, Gabriel and Wiberg, Marie (2023) Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores. Journal of Educational and Behavioral Statistics, 48 (5). pp. 603-635.

[thumbnail of wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores]
Text (wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores)
Available under License Creative Commons Attribution.

Download (0B)
[thumbnail of wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores]
Text (wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores)
Available under License Creative Commons Attribution.

Download (0B)
[thumbnail of wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores]
Text (wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores)
wallin-wiberg-2023-model-misspecification-and-robustness-of-observed-score-test-equating-using-propensity-scores.pdf - Published Version
Available under License Creative Commons Attribution.

Download (548kB)

Abstract

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the propensity score model. The study assumes a parametric form of the propensity score and evaluates the effects of various misspecification scenarios on equating error. The results, based on both simulated and real testing data, show that (1) omitting an important covariate leads to biased estimates of the equated scores, (2) misspecifying a nonlinear relationship between the covariates and test scores increases the equating standard error in the tails of the score distributions, and (3) the equating estimators are robust against omitting a second-order term as well as using an incorrect link function in the propensity score estimation model. The findings demonstrate that auxiliary information is beneficial for test score equating in complex settings. However, it also sheds light on the challenge of making fair comparisons between nonequivalent test groups in the absence of common items. The study identifies scenarios, where equating performance is acceptable and problematic, provides practical guidelines, and identifies areas for further investigation.

Item Type:
Journal Article
Journal or Publication Title:
Journal of Educational and Behavioral Statistics
Uncontrolled Keywords:
Research Output Funding/yes_externally_funded
Subjects:
?? yes - externally funded ??
ID Code:
217016
Deposited By:
Deposited On:
04 Apr 2024 14:00
Refereed?:
Yes
Published?:
Published
Last Modified:
04 Apr 2024 14:00