TY - JOUR
T1 - Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness
AU - Dong, Lin
AU - Laber, Eric
AU - Goldberg, Yair
AU - Song, Rui
AU - Yang, Shu
N1 - Funding Information: information Division of Mathematical Sciences, DMS-1555141; National Cancer Institute, P01 CA142538; National Institute of Diabetes and Digestive and Kidney Diseases, R01-DK-108073 Publisher Copyright: © 2020 John Wiley & Sons, Ltd.
PY - 2020/11/10
Y1 - 2020/11/10
N2 - Dynamic treatment regimes operationalize precision medicine as a sequence of decision rules, one per stage of clinical intervention, that map up-to-date patient information to a recommended intervention. An optimal treatment regime maximizes the mean utility when applied to the population of interest. Methods for estimating an optimal treatment regime assume the data to be fully observed, which rarely occurs in practice. A common approach is to first use multiple imputation and then pool the estimators across imputed datasets. However, this approach requires estimating the joint distribution of patient trajectories, which can be high-dimensional, especially when there are multiple stages of intervention. We examine the application of inverse probability weighted estimating equations as an alternative to multiple imputation in the context of monotonic missingness. This approach applies to a broad class of estimators of an optimal treatment regime including both Q-learning and a generalization of outcome weighted learning. We establish consistency under mild regularity conditions and demonstrate its advantages in finite samples using a series of simulation experiments and an application to a schizophrenia study.
AB - Dynamic treatment regimes operationalize precision medicine as a sequence of decision rules, one per stage of clinical intervention, that map up-to-date patient information to a recommended intervention. An optimal treatment regime maximizes the mean utility when applied to the population of interest. Methods for estimating an optimal treatment regime assume the data to be fully observed, which rarely occurs in practice. A common approach is to first use multiple imputation and then pool the estimators across imputed datasets. However, this approach requires estimating the joint distribution of patient trajectories, which can be high-dimensional, especially when there are multiple stages of intervention. We examine the application of inverse probability weighted estimating equations as an alternative to multiple imputation in the context of monotonic missingness. This approach applies to a broad class of estimators of an optimal treatment regime including both Q-learning and a generalization of outcome weighted learning. We establish consistency under mild regularity conditions and demonstrate its advantages in finite samples using a series of simulation experiments and an application to a schizophrenia study.
KW - Q-learning
KW - augmented inverse probability weighting
KW - dynamic treatment regimes
KW - monotonic coarseness
KW - outcome weighted learning
UR - http://www.scopus.com/inward/record.url?scp=85088806528&partnerID=8YFLogxK
U2 - https://doi.org/10.1002/sim.8678
DO - https://doi.org/10.1002/sim.8678
M3 - Article
C2 - 32729973
SN - 0277-6715
VL - 39
SP - 3503
EP - 3520
JO - Statistics in Medicine
JF - Statistics in Medicine
IS - 25
ER -