Development and Validation of a Deep Learning-Based Automated Detection Algorithm for Major Thoracic Diseases on Chest Radiographs

被引:296
作者
Hwang, Eui Jin [1 ]
Park, Sunggyun [2 ]
Jin, Kwang-Nam [3 ]
Kim, Jung Im [4 ]
Choi, So Young [5 ]
Lee, Jong Hyuk [1 ]
Goo, Jin Mo [1 ]
Aum, Jaehong [2 ]
Yim, Jae-Joon [6 ]
Cohen, Julien G. [7 ]
Ferretti, Gilbert R. [7 ]
Park, Chang Min [1 ]
Kim, Dong Hyeon [8 ]
Woo, Sungmin [9 ]
Choi, Wonseok [8 ]
Hwang, In Pyung [8 ]
Song, Yong Sub [8 ]
Lim, Jiyeon [8 ]
Kim, Hyungjin [8 ]
Wi, Jae Yeon [10 ]
Oh, Su Suk [11 ]
Kang, Mi-Jin [12 ]
Lee, Nyoung Keun [13 ]
Yoo, Jin Young [14 ]
Suh, Young Joo [15 ]
机构
[1] Seoul Natl Univ, Coll Med, Dept Radiol, 101 Daehak Ro, Seoul 03080, South Korea
[2] Lunit Inc, Seoul, South Korea
[3] Seoul Natl Univ, Boramae Med Ctr, Dept Radiol, Seoul, South Korea
[4] Kyung Hee Univ, Coll Med, Kyung Hee Univ Hosp Gangdong, Dept Radiol, Seoul, South Korea
[5] Eulji Univ, Med Ctr, Coll Med, Dept Radiol, Seoul, South Korea
[6] Seoul Natl Univ, Coll Med, Dept Internal Med, Div Pulm & Crit Care Med, Seoul, South Korea
[7] CHU Grenoble, Pole Imagerie, La Tronche, France
[8] Seoul Natl Univ Hosp, Coll Med, Seoul, South Korea
[9] Armed Forces Daejon Hosp, Daejon, South Korea
[10] Asan Med Ctr, Seoul, South Korea
[11] Seoul Natl Univ Hosp, Seoul, South Korea
[12] Inje Univ, Sanggyepaik Hosp, Seoul, South Korea
[13] Sungmin Hosp, Incheon, South Korea
[14] Chungbuk Natl Univ Hosp, Cheongju, South Korea
[15] Yonsei Univ, Coll Med, Seoul, South Korea
关键词
D O I
10.1001/jamanetworkopen.2019.1095
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IMPORTANCE Interpretation of chest radiographs is a challenging task prone to errors, requiring expert readers. An automated system that can accurately classify chest radiographs may help streamline the clinical workflow. OBJECTIVES To develop a deep learning-based algorithm that can classify normal and abnormal results from chest radiographs with major thoracic diseases including pulmonary malignant neoplasm, active tuberculosis, pneumonia, and pneumothorax and to validate the algorithm's performance using independent data sets. DESIGN, SETTING, AND PARTICIPANTS This diagnostic study developed a deep learning-based algorithm using single-center data collected between November 1, 2016, and January 31, 2017. The algorithm was externally validated with multicenter data collected between May 1 and July 31, 2018. A total of 54 221 chest radiographs with normal findings from 47 917 individuals (21 556 men and 26 361 women; mean [SD] age, 51 [16] years) and 35 613 chest radiographs with abnormal findings from 14 102 individuals (8373 men and 5729 women; mean [SD] age, 62 [15] years) were used to develop the algorithm. A total of 486 chest radiographs with normal results and 529 with abnormal results (1 from each participant; 628 men and 387 women; mean [SD] age, 53 [18] years) from 5 institutions were used for external validation. Fifteen physicians, including nonradiology physicians, board-certified radiologists, and thoracic radiologists, participated in observer performance testing. Data were analyzed in August 2018. EXPOSURES Deep learning-based algorithm. MAIN OUTCOMES AND MEASURES Image-wise classification performances measured by area under the receiver operating characteristic curve; lesion-wise localization performances measured by area under the alternative free-response receiver operating characteristic curve. RESULTS The algorithm demonstrated a median (range) area under the curve of 0.979 (0.973-1.000) for image-wise classification and 0.972 (0.923-0.985) for lesion-wise localization; the algorithm demonstrated significantly higher performance than all 3 physician groups in both image-wise classification (0.983 vs 0.814-0.932; all P < .005) and lesion-wise localization (0.985 vs 0.781-0.907; all P < .001). Significant improvements in both image-wise classification (0.814-0.932 to 0.904-0.958; all P < .005) and lesion-wise localization (0.781-0.907 to 0.873-0.938; all P < .001) were observed in all 3 physician groups with assistance of the algorithm. CONCLUSIONS AND RELEVANCE The algorithm consistently outperformed physicians, including thoracic radiologists, in the discrimination of chest radiographs with major thoracic diseases, demonstrating its potential to improve the quality and efficiency of clinical practice.
引用
收藏
页数:13
相关论文
共 38 条
[1]  
[Anonymous], R LANG ENV STAT COMP
[2]   Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer [J].
Bejnordi, Babak Ehteshami ;
Veta, Mitko ;
van Diest, Paul Johannes ;
van Ginneken, Bram ;
Karssemeijer, Nico ;
Litjens, Geert ;
van der Laak, Jeroen A. W. M. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (22) :2199-2210
[3]  
Chakraborty D.P., ANAL DATA ACQUIRED U
[4]  
Coche E.E., 2011, Comparative Interpretation of CT and Standard Radiography of the Chest, V1A, P480, DOI DOI 10.1007/978-3-540-79942-9
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]   Common patterns in 558 diagnostic radiology errors [J].
Donald, Jennifer J. ;
Barnard, Stuart A. .
JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2012, 56 (02) :173-178
[7]   Dermatologist-level classification of skin cancer with deep neural networks [J].
Esteva, Andre ;
Kuprel, Brett ;
Novoa, Roberto A. ;
Ko, Justin ;
Swetter, Susan M. ;
Blau, Helen M. ;
Thrun, Sebastian .
NATURE, 2017, 542 (7639) :115-+
[8]  
Forum of International Respiratory Societies, 2017, FORUM INT RESP SOC, V2nd
[9]   Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs [J].
Gulshan, Varun ;
Peng, Lily ;
Coram, Marc ;
Stumpe, Martin C. ;
Wu, Derek ;
Narayanaswamy, Arunachalam ;
Venugopalan, Subhashini ;
Widner, Kasumi ;
Madams, Tom ;
Cuadros, Jorge ;
Kim, Ramasamy ;
Raman, Rajiv ;
Nelson, Philip C. ;
Mega, Jessica L. ;
Webster, R. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2016, 316 (22) :2402-2410
[10]  
HOLM S, 1979, SCAND J STAT, V6, P65