A proposed design and analysis for comparing digital and analog mammography: Special receiver operating characteristic methods for cancer screening

被引：39

作者：

Baker, SG ^{[1
]}

Pinsky, PF

机构：

[1] NCI, Biometry Res Grp, Bethesda, MD 20892 USA

[2] NCI, Early Detect Res Grp, Div Canc Prevent, Bethesda, MD 20892 USA

来源：

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION | 2001年 / 96卷 / 454期

关键词：

cancer screening; diagnostic testing; double sampling; permutation test; sample size; verification bias; sensitivity; specificity;

D O I：

10.1198/016214501753168136

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Because randomized trials have shown a reduction in breast cancer mortality, analog mammography for the early detection of breast cancer has gained widespread use. Recently, several manufacturers have developed digital mammography, which promises great advantages in the storage and transmission of images. We were asked to design a study to compare the two types of mammography in terms of their performance for the early detection of breast cancer. A standard measure of mammography performance is the receiver operating characteristic (ROC) curve, which is a plot of false- and true-positive rates for each ordered classification of the mammography images. Methods for study design and data analysis based on ROC curves have been well developed for diagnostic tests, particularly in radiology. But for comparing the performance of mammography for the early detection of breast cancer among asymptomatic women, special considerations motivate new designs and methodology. First, digital mammography may cost substantially more than analog mammography. If this is the case, then the standard paired design, in which each subject undergoes both types of mammography, may be more expensive than necessary. To reduce costs, we propose a partial testing design, in which all subjects undergo analog mammography and those recommended for biopsy and a random sample not recommended for biopsy also undergo digital mammography. Second, the false-positive rate for analog mammography, defined as the rate of unnecessary biopsy, is near 1%. A standard ROC analysis that compares areas under the entire ROC curve would summarize performance over false-positive rates that are not relevant for evaluating the performance of cancer screening. As a more appropriate alternative, we propose basing inference on the-areas under the small parr of the ROC curves near the false-positive rates corresponding to a biopsy recommendation. Third, the vast majority of screened subjects are not biopsied, and so have an unknown cancer state at the time of screening. To make inference about the performance of a cancer screening test, the standard approach is to follow subjects not biopsied for some period, usually 1 year, and assume that those who developed cancer were missed on screening and those who did not develop cancer were cancer-free at screening. Unfortunately, this follow-up period can greatly lengthen the duration of the study. To compare the performance of digital and analog mammography without the need for a follow-up period, we propose estimating the ratio of areas under the ROC curves near the small false-positive rates associated with a biopsy recommendation. To compute sample sizes, our null hypothesis is that the ratio of partial ROC areas is 1, and our two possible alternative hypotheses are ratios of 1.6 and 2, both indicating superior performance for digital mammography. We assume a breast cancer prevalence of .003 and specify various parameters for the shapes of the ROC curves and their dependence. For a two-sided type I error of .05 and a power of .9, a standard paired design would require that 22,000 subjects undergo both analog and digital mammography. For the same type I error and power, the proposed partial testing design would require that 35,000 subjects undergo analog mammography and 10,000 subjects undergo both analog and digital mammography. Compared to the paired design, the reduction in the cost per subject is 23% if digital mammography costs four times as much as analog mammography and 41% if digital mammography costs 10 times as much as analog mammography.

引用

页码：421 / 428

页数：8

共 35 条

[1] ESTIMATION OF SOJOURN TIME DISTRIBUTIONS AND FALSE NEGATIVE RATES IN SCREENING PROGRAMS WHICH USE 2 MODALITIES [J].

ALEXANDER, FE .

STATISTICS IN MEDICINE, 1989, 8 (06) :743-755

[2]

American college of Radiology, 1995, BREAST IM REP DAT SY

[3] HAS THE USE OF CERVICAL, BREAST, AND COLORECTAL-CANCER SCREENING INCREASED IN THE UNITED-STATES [J].

ANDERSON, LM ;

MAY, DS .

AMERICAN JOURNAL OF PUBLIC HEALTH, 1995, 85 (06) :840-842

[4] EVALUATING SCREENING FOR THE EARLY DETECTION AND TREATMENT OF CANCER WITHOUT USING A RANDOMIZED CONTROL-GROUP [J].

BAKER, SG ;

CHU, KC .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :321-327

[5] Identifying combinations of cancer markers for further study as triggers of early intervention [J].

Baker, SG .

BIOMETRICS, 2000, 56 (04) :1082-1087

[6]

Baker SG, 1998, STAT MED, V17, P2219

[7] EVALUATING MULTIPLE DIAGNOSTIC-TESTS - WITH PARTIAL VERIFICATION [J].

BAKER, SG .

BIOMETRICS, 1995, 51 (01) :330-337

[8] THE MULTINOMIAL-POISSON TRANSFORMATION [J].

BAKER, SG .

STATISTICIAN, 1994, 43 (04) :495-504

[9] AREA ABOVE ORDINAL DOMINANCE GRAPH AND AREA BELOW RECEIVER OPERATING CHARACTERISTIC GRAPH [J].

BAMBER, D .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1975, 12 (04) :387-415

[10] PROFESSIONAL QUALITY ASSURANCE FOR MAMMOGRAPHY SCREENING PROGRAMS [J].

BIRD, RE .

RADIOLOGY, 1990, 177 (02) :587-587

← 1 2 3 4 →