Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Assigning ICD-O-3 codes to pathology reports using neural multi-task training with hierarchical regularization

Producción científica: Conference contributionrevisión exhaustiva

9 Citas (Scopus)

Resumen

Tracking population-level cancer information is essential for researchers, clinicians, policymakers, and the public. Unfortunately, much of the information is stored as unstructured data in pathology reports. Thus, too process the information, we require either automated extraction techniques or manual curation. Moreover, many of the cancer-related concepts appear infrequently in real-world training datasets. Automated extraction is difficult because of the limited data. This study introduces a novel technique that incorporates structured expert knowledge to improve histology and topography code classification models. Using pathology reports collected from the Kentucky Cancer Registry, we introduce a novel multi-task training approach with hierarchical regularization that incorporates structured information about the International Classification of Diseases for Oncology, 3rd Edition classes to improve predictive performance. Overall, we find that our method improves both micro and macro F1. For macro F1, we achieve up to a 6% absolute improvement for topography codes and up to 4% absolute improvement for histology codes.

Idioma originalEnglish
Título de la publicación alojadaProceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2021
ISBN (versión digital)9781450384506
DOI
EstadoPublished - ene 18 2021
Evento12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2021 - Virtual, Online, United States
Duración: ago 1 2021ago 4 2021

Serie de la publicación

NombreProceedings of the 12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2021

Conference

Conference12th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2021
País/TerritorioUnited States
CiudadVirtual, Online
Período8/1/218/4/21

Nota bibliográfica

Publisher Copyright:
© 2021 ACM.

Financiación

This research was supported by the Shared Resource Facilities of the University of Kentucky Markey Cancer Center (P30CA177558). Kavuluru’s effort was also supported by the U.S. National Library of Medicine under award number R01LM013240.

FinanciadoresNúmero del financiador
U.S. National Library of MedicineR01LM013240
University of Kentucky Markey Comprehensive Cancer CenterP30CA177558

    ODS de las Naciones Unidas

    Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

    1. Good health and well being
      Good health and well being

    ASJC Scopus subject areas

    • Computer Science Applications
    • Software
    • Biomedical Engineering
    • Health Informatics

    Huella

    Profundice en los temas de investigación de 'Assigning ICD-O-3 codes to pathology reports using neural multi-task training with hierarchical regularization'. En conjunto forman una huella única.

    Citar esto