Incorporating breast cancer recurrence events into population-based cancer registries using medical claims: Cohort study

  • Teresa A'mar
  • , J. David Beatty
  • , Catherine Fedorenko
  • , Daniel Markowitz
  • , Thomas Corey
  • , Jane Lange
  • , Stephen M. Schwartz
  • , Bin Huang
  • , Jessica Chubak
  • , Ruth Etzioni

Producción científica: Articlerevisión exhaustiva

10 Citas (Scopus)

Resumen

Background: There is a need for automated approaches to incorporate information on cancer recurrence events into population-based cancer registries. Objective: The aim of this study is to determine the accuracy of a novel data mining algorithm to extract information from linked registry and medical claims data on the occurrence and timing of second breast cancer events (SBCE). Methods: We used supervised data from 3092 stage I and II breast cancer cases (with 394 recurrences), diagnosed between 1993 and 2006 inclusive, of patients at Kaiser Permanente Washington and cases in the Puget Sound Cancer Surveillance System. Our goal was to classify each month after primary treatment as pre- versus post-SBCE. The prediction feature set for a given month consisted of registry variables on disease and patient characteristics related to the primary breast cancer event, as well as features based on monthly counts of diagnosis and procedure codes for the current, prior, and future months. A month was classified as post-SBCE if the predicted probability exceeded a probability threshold (PT); the predicted time of the SBCE was taken to be the month of maximum increase in the predicted probability between adjacent months. Results: The Kaplan-Meier net probability of SBCE was 0.25 at 14 years. The month-level receiver operating characteristic curve on test data (20% of the data set) had an area under the curve of 0.986. The person-level predictions (at a monthly PT of 0.5) had a sensitivity of 0.89, a specificity of 0.98, a positive predictive value of 0.85, and a negative predictive value of 0.98. The corresponding median difference between the observed and predicted months of recurrence was 0 and the mean difference was 0.04 months. Conclusions: Data mining of medical claims holds promise for the streamlining of cancer registry operations to feasibly collect information about second breast cancer events.

Idioma originalEnglish
Número de artículoe18143
PublicaciónJMIR Cancer
Volumen6
N.º2
DOI
EstadoPublished - jul 2020

Nota bibliográfica

Publisher Copyright:
©Teresa A'mar, J David Beatty, Catherine Fedorenko, Daniel Markowitz, Thomas Corey, Jane Lange, Stephen M Schwartz, Bin Huang, Jessica Chubak, Ruth Etzioni.

Financiación

This work was supported by the National Institutes of Health (UG3CA218909, R21CA143242, R01CA120562, and R01CA093772) and by the American Cancer Society (CRTG-03–024-01-CCE). The collection of cancer incidence data used in this study was supported by the Cancer Surveillance System of the Fred Hutchinson Cancer Research Center, which is funded by Contract Number N01-CN-67009 and N01-PC-35142 from the Surveillance, Epidemiology and End Results (SEER) Program of the National Cancer Institute with additional support from the Fred Hutchinson Cancer Research Center and the State of Washington. RE’s work is partially supported by the Rosalie and Harold Rea Brown Endowment.

FinanciadoresNúmero del financiador
Rosalie and Harold Rea Brown Endowment
Washington State University
National Cancer Institute's Surveillance Epidemiology and End Results
National Institutes of Health (NIH)R21CA143242, UG3CA218909, R01CA120562, R01CA093772
American Cancer SocietyCRTG-03–024-01-CCE, N01-PC-35142, N01-CN-67009
National Childhood Cancer Registry – National Cancer Institute

    ODS de las Naciones Unidas

    Este resultado contribuye a los siguientes Objetivos de Desarrollo Sostenible

    1. Good health and well being
      Good health and well being

    ASJC Scopus subject areas

    • Oncology
    • Cancer Research

    Huella

    Profundice en los temas de investigación de 'Incorporating breast cancer recurrence events into population-based cancer registries using medical claims: Cohort study'. En conjunto forman una huella única.

    Citar esto