Subgroup identification from randomized clinical trial data

被引:401
作者
Foster, Jared C. [1 ]
Taylor, Jeremy M. G. [1 ]
Ruberg, Stephen J. [2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Eli Lilly, Global Stat Sci, Adv Analyt, Indianapolis, IN USA
基金
美国国家卫生研究院;
关键词
randomized clinical trials; subgroups; random forests; regression trees; tailored therapeutics;
D O I
10.1002/sim.4322
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We consider the problem of identifying a subgroup of patients who may have an enhanced treatment effect in a randomized clinical trial, and it is desirable that the subgroup be defined by a limited number of covariates. For this problem, the development of a standard, pre-determined strategy may help to avoid the well-known dangers of subgroup analysis. We present a method developed to find subgroups of enhanced treatment effect. This method, referred to as 'Virtual Twins', involves predicting response probabilities for treatment and control 'twins' for each subject. The difference in these probabilities is then used as the outcome in a classification or regression tree, which can potentially include any set of the covariates. We define a measure Q((A) over cap) to be the difference between the treatment effect in estimated subgroup ((A) over cap) and the marginal treatment effect. We present several methods developed to obtain an estimate of Q((A) over cap), including estimation of Q((A) over cap) using estimated probabilities in the original data, using estimated probabilities in newly simulated data, two cross-validation-based approaches, and a bootstrap-based bias-corrected approach. Results of a simulation study indicate that the Virtual Twins method noticeably outperforms logistic regression with forward selection when a true subgroup of enhanced treatment effect exists. Generally, large sample sizes or strong enhanced treatment effects are needed for subgroup estimation. As an illustration, we apply the proposed methods to data from a randomized clinical trial. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:2867 / 2880
页数:14
相关论文
共 21 条
[1]   Subgroup analysis and other (mis)uses of baseline data in clinical trials [J].
Assmann, SF ;
Pocock, SJ ;
Enos, LE ;
Kasten, LE .
LANCET, 2000, 355 (9209) :1064-1069
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]  
Brookes S T, 2001, Health Technol Assess, V5, P1
[4]   Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test [J].
Brookes, ST ;
Whitely, E ;
Egger, M ;
Smith, GD ;
Mulheran, PA ;
Peters, TJ .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2004, 57 (03) :229-236
[5]  
Cui Lu, 2002, J Biopharm Stat, V12, P347, DOI 10.1081/BIP-120014565
[6]   BAYESIAN SUBSET ANALYSIS [J].
DIXON, DO ;
SIMON, R .
BIOMETRICS, 1991, 47 (03) :871-881
[7]   Commentary: The problem of cogent subgroups: A clinicostatistical tragedy [J].
Feinstein, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1998, 51 (04) :297-299
[8]   The Cross-Validated Adaptive Signature Design [J].
Freidlin, Boris ;
Jiang, Wenyu ;
Simon, Richard .
CLINICAL CANCER RESEARCH, 2010, 16 (02) :691-698
[9]   TESTING FOR QUALITATIVE INTERACTIONS BETWEEN TREATMENT EFFECTS AND PATIENT SUBSETS [J].
GAIL, M ;
SIMON, R .
BIOMETRICS, 1985, 41 (02) :361-372
[10]   Responder identification in clinical trials with censored data [J].
Kehl, V ;
Ulm, K .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (05) :1338-1355