Lee, Kim and Mitra, Robin and Biedermann, Stefanie (2018) Optimal design when outcome values are not missing at random. Statistica Sinica, 28 (4). pp. 1821-1838. ISSN 1017-0405
SS_2016_0526_Preprint.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial.
Download (2MB)
A28n48.pdf - Published Version
Available under License Creative Commons Attribution-NonCommercial.
Download (424kB)
Abstract
The presence of missing values complicates statistical analyses. In design of experiments, missing values are particularly problematic when constructing optimal designs, as it is not known which values are missing at the design stage. When data are missing at random it is possible to incorporate this information into the optimality criterion that is used to find designs; Imhof, Song and Wong (2002) develop such a framework. However, when data are not missing at random this framework can lead to inefficient designs. We investigate and address the specific challenges that not missing at random values present when finding optimal designs for linear regression models. We show that the optimality criteria will depend on model parameters that traditionally do not affect the design, such as regression coefficients and the residual variance. We also develop a framework that improves efficiency of designs over those found assuming values are missing at random.