Abstract

Surveillance of drug overdose deaths relies on death certificates for identification of the substances that caused death. Drugs and drug classes can be identified through the International Classification of Diseases, Tenth Revision (ICD-10), codes present on death certificates. However, ICD-10 codes do not always provide high levels of specificity in drug identification. To achieve more fine-grained identification of substances on death certificate, the free-text cause-of-death section, completed by the medical certifier, must be analyzed. Current methods for analyzing free-text death certificates rely solely on lookup tables for identifying specific substances, which must be frequently updated and maintained. To improve identification of drugs on death certificates, a deep-learning named-entity recognition model was developed, utilizing data from the Kentucky Drug Overdose Fatality Surveillance System (2014–2019), which achieved an F1-score of 99.13%. This model can identify new drug misspellings and novel substances that are not present on current surveillance lookup tables, enhancing the surveillance of drug overdose deaths.

Original languageEnglish
Pages (from-to)257-266
Number of pages10
JournalAmerican Journal of Epidemiology
Volume192
Issue number2
DOIs
StatePublished - Feb 1 2023

Bibliographical note

Publisher Copyright:
© The Author(s) 2022. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved.

Funding

The Kentucky Injury Prevention and Research Center (KIPRC) is not the owner of the data used in our study; the data is owned by the Kentucky State Office of Vital Statistics (OVS). In accordance with KIPRC’s Memorandum of Understanding with the Kentucky OVS, we are legally prohibited from releasing line-level death certificate data. Death certificates contain identifying information (including name, location of residence, and social security numbers), and in some cases the free text describing how an individual died can be identifying. Each state houses their own OVS, and data may be requested for research purposes from this body; the address for the Kentucky Office of Vital Statistics is 275 E. Main St, 1E-A, Frankfort, KY 40621, and data may be requested from here for research purposes. Additionally, the National Center for Health Statistics has recently made available a death certificate literal-text file, which can be requested and contains the free-text cause-of-death information for all US resident deaths: https://www.cdc.gov/rdc/b1datatype/rdcltf. html. Documented code for the methods described are available at https://github.com/pjward5656/DC_flair. We thank the Kentucky Office of Vital Statistics for providing the data for this study.

FundersFunder number
Kentucky Injury Prevention and Research Center
Kentucky OVS
Kentucky Office of Vital Statistics1E-A
Kentucky State Office of Vital Statistics

    Keywords

    • deep learning
    • drug overdose
    • machine learning
    • surveillance

    ASJC Scopus subject areas

    • General Medicine

    Fingerprint

    Dive into the research topics of 'Deep Neural Networks for Fine-Grained Surveillance of Overdose Mortality'. Together they form a unique fingerprint.

    Cite this