COVID-19 on Chest Radiographs: A Multireader Evaluation of an Artificial Intelligence System

被引:146
作者
Murphy, Keelin [1 ]
Smits, Henk [2 ]
Knoops, Arnoud J. G. [3 ]
Korst, Michael B. J. M. [3 ]
Samson, Tijs [3 ]
Scholten, Ernst T. [1 ]
Schalekamp, Steven [1 ]
Schaefer-Prokop, Cornelia M. [1 ,4 ]
Philipsen, Rick H. H. M. [5 ]
Meijers, Annet [5 ]
Melendez, Jaime [5 ]
van Ginneken, Bram [1 ]
Rutten, Matthieu [3 ]
机构
[1] Radboud Univ Nijmegen, Diagnost Image Anal Grp, Med Ctr, Geert Grotepl 10, NL-6500 HB Nijmegen, Netherlands
[2] Bernhoven Hosp, Dept Radiol, Uden, Netherlands
[3] Jeroen Bosch Hosp, Dept Radiol, Shertogenbosch, Netherlands
[4] Meander Med Ctr, Dept Radiol, Amersfoort, Netherlands
[5] Thirona, Nijmegen, Netherlands
关键词
D O I
10.1148/radiol.2020201874
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background: Chest radiography may play an important role in triage for coronavirus disease 2019 (COVID-19), particularly in low-resource settings. Purpose: To evaluate the performance of an artificial intelligence (AI) system for detection of COVID-19 pneumonia on chest radiographs. Materials and Methods: An AI system (CAD4COVID-XRay) was trained on 24 678 chest radiographs, including 1540 used only for validation while training. The test set consisted of a set of continuously acquired chest radiographs (n = 454) obtained in patients suspected of having COVID-19 pneumonia between March 4 and April 6, 2020, at one center (223 patients with positive reverse transcription polymerase chain reaction [RT-PCR] results, 231 with negative RT-PCR results). Radiographs were independently analyzed by six readers and by the AI system. Diagnostic performance was analyzed with the receiver operating characteristic curve. Results: For the test set, the mean age of patients was 67 years +/- 14.4 (standard deviation) (56% male). With RT-PCR test results as the reference standard, the AI system correctly classified chest radiographs as COVID-19 pneumonia with an area under the receiver-operating characteristic curve of 0.81. The system significantly outperformed each reader (P<.001 using the McNemar test) at their highest possible sensitivities. At their lowest sensitivities, only one reader significantly outperformed the AI system (P =.04). Conclusion: The performance of an artificial intelligence system in the detection of coronavirus disease 2019 on chest radiographs was comparable with that of six independent readers. (C) RSNA, 2020
引用
收藏
页码:E166 / E172
页数:7
相关论文
共 20 条
[1]  
Annarumma M, 2019, RADIOLOGY, V291, P195, DOI 10.1148/radiol.2018180921
[2]  
[Anonymous], RSNA PNEUM DET CHALL
[3]  
[Anonymous], RADIOLOGY
[4]  
Cheng MP, ANN INTERN MED
[5]  
Efron B., 1981, The Canadian Journal of Statistics, V9, P139, DOI DOI 10.2307/3314608
[6]  
Eng J, RADIOLOGY
[7]   Development and Validation of a Deep Learning-Based Automated Detection Algorithm for Major Thoracic Diseases on Chest Radiographs (vol 2, e191095, 2019) [J].
Hwang, E. J. ;
Park, S. ;
Jin, K-N .
JAMA NETWORK OPEN, 2019, 2 (04)
[8]   Deep Learning for Chest Radiograph Diagnosis in the Emergency Department [J].
Hwang, Eui Jin ;
Nam, Ju Gang ;
Lim, Woo Hyeon ;
Park, Sae Jin ;
Jeong, Yun Soo ;
Kang, Ji Hee ;
Hong, Eun Kyoung ;
Kim, Taek Min ;
Goo, Jin Mo ;
Park, Sunggyun ;
Kim, Ki Hwan ;
Park, Chang Min .
RADIOLOGY, 2019, 293 (03) :573-580
[9]   Portable chest X-ray in coronavirus disease-19 (COVID-19): A pictorial review [J].
Jacobi, Adam ;
Chung, Michael ;
Bernheim, Adam ;
Eber, Corey .
CLINICAL IMAGING, 2020, 64 :35-42
[10]  
Mollura D.J., 2019, Radiology in global health: Strategies, implementation, and applications