Abstract
The efficacy of supervised deep learning in medical image analyses, particularly in pathology, is hindered by the necessity for extensive manual annotations. Annotating images at the gigapixel level manually proves to be a highly labor-intensive and time-consuming task. Semi-supervised learning (SSL) has emerged as a promising approach that leverages unlabeled data to reduce labeling efforts. In this work, we introduce Semi-Path, a practical SSL framework enhanced with active learning (AL) for gigapixel pathology tasks. Unlike existing methods that treat SSL and AL as independent components where AL incurs significant computational complexity to SSL, we propose a deep fusion of SSL and AL into a unified framework. Our framework introduces Informative Active Annotation (IAA) that employs a SSL-AL iterative structure to effectively extract knowledge from unlabeled pathology data. This structure significantly minimizes labeling efforts and computational complexity. Then, we propose Adaptive Pseudo-Labeling (APL) to address heterogeneity in class distribution, and prediction difficulty that are often observed in real-world pathology tasks. We evaluate Semi-Path on pathology image classification and segmentation tasks over three datasets that include WSIs from breast, colorectal, and brain tissues. The experimental results demonstrate the consistent superiority of Semi-Path over state-of-the-art methods.
Original language | English |
---|---|
Article number | 100474 |
Journal | Smart Health |
Volume | 32 |
DOIs | |
State | Published - Jun 2024 |
Bibliographical note
Publisher Copyright:© 2024 The Authors
Funding
This work was supported by the Noyce Initiative UC Partnerships in Computational Transformation Grant and the UC Davis Center for Women's Cardiovascular and Brain Health research program under the HEAL-HER (Heart, BrEast, and BrAin HeaLth Equity Research) award made possible by the Cy Pres funds. This work also received additional partial support from National Institutes of Health grants P30 AG072972 and R01 AG062517. The authors would like to thank the families and participants of the University of California, Davis Alzheimer's Disease Research Center (UCD-ADRC) for their generous donations, as well as all the faculty and staff of the UCD-ADRC. The views and opinions expressed in this manuscript are those of the author and do not necessarily reflect the official policy or position of any public health agency of California or of the United States government. This work was supported by the Noyce Initiative UC Partnerships in Computational Transformation Grant and the UC Davis Center for Women’s Cardiovascular and Brain Health research program under the HEAL-HER (Heart, BrEast, and BrAin HeaLth Equity Research) award made possible by the Cy Pres funds . This work also received additional partial support from National Institutes of Health grants P30 AG072972 and R01 AG062517 . The authors would like to thank the families and participants of the University of California, Davis Alzheimer’s Disease Research Center (UCD-ADRC) for their generous donations, as well as all the faculty and staff of the UCD-ADRC. The views and opinions expressed in this manuscript are those of the author and do not necessarily reflect the official policy or position of any public health agency of California or of the United States government.
Funders | Funder number |
---|---|
UCD-ADRC | |
Noyce Initiative UC | |
University of California, Davis Alzheimer's Disease Research Center | |
University of California Davis | |
National Institutes of Health (NIH) | P30 AG072972, R01 AG062517 |
National Institutes of Health (NIH) |
Keywords
- Active learning
- Pathology image analysis
- Semi-supervised learning
ASJC Scopus subject areas
- Medicine (miscellaneous)
- Information Systems
- Health Informatics
- Computer Science Applications
- Health Information Management