Computer Vision / Video Analytics

Improving Breast Cancer Detection in Ultrasound Imaging Using AI

Jun 24, 2021

By Farah E. Shamout, Artie Shen, Jan Witowski, Jamie Oliver and Krzysztof Geras

Discuss (0)

AI-Generated Summary

Dislike

Researchers developed an AI system using a deep neural network that mimics radiologists' diagnostic procedures to improve breast cancer detection in ultrasound exams.
The AI system was trained on approximately four million ultrasound images on an HPC cluster powered by NVIDIA technologies and achieved higher diagnostic accuracy than individual radiologists.
A hybrid AI-radiologist model decreased false positive rates by 37.4% and reduced the number of requested biopsies by 27.8% while maintaining the same level of sensitivity as radiologists.

AI-generated content may summarize information incompletely. Verify important information. Learn more

Breast cancer is the most frequently diagnosed cancer among women worldwide. It’s also the leading cause of cancer-related deaths. Identifying breast cancer at an early stage before metastasis enables more effective treatments and therefore significantly improves survival rates.

Although mammography is the most widely used imaging technique for early detection of breast cancer, it is not always available in low-resource settings. Its sensitivity also drops for women with dense breast tissue.

Breast ultrasound is often used as a supplementary imaging modality to mammography in screening settings, and as the primary imaging modality in diagnostic settings. Despite its advantages, including lower costs relative to mammography, it is difficult to interpret breast ultrasound images as evident by the considerable intra-reader variability. This leads to increased false-positive findings, unnecessary biopsies, and significant discomfort to patients.

Previous work using deep learning for breast ultrasound has been based predominantly on small datasets on the scale of thousands of images. Many of these efforts also rely on expensive and time-consuming manual annotation of images to obtain image-level (presence of cancer in each image) or pixel-level (exact location of each lesion) labels.

Using AI to improve breast cancer detection

In our recent paper, Artificial Intelligence System Reduces False-Positive Findings in the Interpretation of Breast Ultrasound Exams, we leverage the full potential of deep learning and eliminate the need for manual annotations by designing a weakly supervised deep neural network whose working resembles the diagnostic procedure of radiologists (Figure 1).

Radiologist diagnostic procedure compared to AI

The following table compares how radiologists make predictions compared to our AI system.

RADIOLOGIST	AI NETWORK
Looks for abnormal findings in each image within a breast ultrasound exam.	Processes each image within an exam independently using a ResNet-18 model and generates saliency map for it, indicating the most important parts.
Concentrates on images that contain suspicious lesions.	Assigns attention scores to each image based on its relative importance.
Considers signals in all images to make a final diagnosis	Aggregates information from all images using an attention mechanism to compute the final predictions for benign and malignant findings.

Table 1. Comparing radiology diagnostic procedure to AI

We compared the performance of the trained network to 10 board-certified breast radiologists in a reader study and to hybrid AI-radiologist models, which average the prediction of the AI and each radiologist.

The neural network was trained with a dataset consisting of approximately four million ultrasound images on an HPC cluster powered by NVIDIA technologies. The cluster consists of 34 computation nodes each of which is equipped with 80 CPUs and four NVIDIA V100 GPUs (16/32 GB). With this cluster, we performed hyperparameter search by launching experiments (each taking around 300 GPU hours) over a broad range of hyperparameters.

A large-scale dataset

To complete this ambitious project, we preprocessed more than eight million breast ultrasound images collected at NYU Langone between 2012 and 2019 and extracted breast-level cancer labels by mining pathology reports.

Training set: 3,930,347 images within 209,162 exams collected from 101,493 patients.
Validation set: 653,924 images within 34,850 exams collected from 16,707 patients.
Internal test set: 858,636 images within 44,755 exams collected from 25,003 patients.

Results: the most exciting part!

Our results show that a hybrid AI-radiologist model decreased false positive rates by 37.4% (that is, false suspicions of malignancy). This would lead to a reduction in the number of requested biopsies by 27.8%, while maintaining the same level of sensitivity as radiologists (Figure 3).

When acting independently, the AI system achieved higher area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) than individual readers. Figure 3 shows how each reader compares to the network’s performance.

Within the internal test set, the AI system maintained high diagnostic accuracy (0.940-0.990 AUROC) across all age groups, mammographic breast densities, and device manufacturers, including GE, Philips, and Siemens. In the biopsied population, it also achieved a 0.940 AUROC.

In an external test set collected in Egypt, the system achieved 0.911 AUROC, highlighting its generalization ability in patient demographics not seen during training (Figure 4).

Based on qualitative assessment, the network produced appropriate localization information of benign and malignant lesions through its saliency maps. In the exam shown in Figure 4, all 10 breast radiologists thought the lesion appeared suspicious for malignancy and recommended that it undergo biopsy, while the AI system correctly classified it as benign. Most impressively, locations of lesions were never given during training, as it was trained in a weakly supervised manner!

Future work

For our next steps, we’d like to evaluate our system through prospective validation before it can be widely deployed in clinical practice. This enables us to measure its potential impact in improving the experience of women who undergo breast ultrasound examinations each year on a global level.

In conclusion, our work highlights the complementary role of an AI system in improving diagnostic accuracy by significantly decreasing unnecessary biopsies. Beyond improving radiologists’ performance, we have made technical contributions to the methodology of deep learning for medical imaging analysis.

This work would not have been possible without state-of-the-art computational resources. For more information, see the preprint, Artificial Intelligence System Reduces False-Positive Findings in the Interpretation of Breast Ultrasound Exams.

Discuss (0)

About the Authors

About Farah E. Shamout
Farah Shamout is an assistant professor and emerging scholar in computer engineering at NYU Abu Dhabi, where she leads the Clinical Artificial Intelligence Laboratory. Her research focuses on using AI, data science, and machine learning to solve real-world medical problems. Before joining NYU Abu Dhabi, Shamout completed her DPhil (PhD) in engineering science at the University of Oxford in 2019 at the Computational Health Informatics Laboratory on the Rhodes Scholarship.

View all posts by Farah E. Shamout

About Artie Shen
Artie (Yiqiu) Shen is a Ph.D. student at the Center for Data Science, co-advised by Prof. Kyunghyun Cho and Prof. Krzysztof J. Geras. His research interests primarily lie in AI for healthcare and deep learning for medical image analysis. Before joining NYU, he was a software engineer at Two Sigma Investments where he maintained a platform that extracts trading signals from market sentiment. He earned a bachelor’s degree in computer science from Rice University.

View all posts by Artie Shen

About Jan Witowski
Jan is a postdoctoral research fellow at NYU School of Medicine. His research interests include medical image processing, especially in cancer imaging. He graduated from an MD and PhD programs at the Jagiellonian University in Kraków, Poland and previously worked at Harvard Medical School. He is a Forbes 30 under 30 recipient.

View all posts by Jan Witowski

About Jamie Oliver
Jamie Oliver is a senior medical student at the NYU Grossman School of Medicine. His research interests primarily lie in applications of artificial intelligence and machine learning in oncology care and medical image analysis. Before medical school, he earned a bachelor’s degree in psychology with minors in neuroscience and computer science from Princeton University.

View all posts by Jamie Oliver

About Krzysztof Geras
Krzysztof is an assistant professor at NYU School of Medicine and an affiliated faculty at NYU Center for Data Science. His work focuses on developing new deep learning methods and their applications to medical imaging. He previously completed a postdoc at NYU, a PhD at the University of Edinburgh, and a BSc at the the University of Warsaw. He also completed industrial internships at Microsoft Research, Amazon, Microsoft and J.P. Morgan.

View all posts by Krzysztof Geras