Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Extracting Semantics from Census-based Reference Data

Producción científica: Conference contributionrevisión exhaustiva

1 Cita (Scopus)

Resumen

We present preliminary findings in extracting semantics from reference data generated by the United States Census Bureau. US Census reference data is based upon surveys designed to collect demographics and other socioeconomic factors by geographical regions. These data sets contain thousands of variables; this complexity makes the reference data difficult to learn, query, and integrate into analyses. Researchers often avoid working directly with US Census reference data and instead work with census-derived extracts capturing a much smaller subset of records. We propose to use natural language processing to extract the semantics of census-based reference data and to map census variables to known ontologies. This semantic processing reduces the large volume of variables into more manageable sets of conceptual variables that can be organized by meaning and semantic type.

Idioma originalEnglish
Título de la publicación alojadaProceedings - 2021 IEEE 15th International Conference on Semantic Computing, ICSC 2021
Páginas88-89
Número de páginas2
ISBN (versión digital)9781728188997
DOI
EstadoPublished - ene 2021
Evento15th IEEE International Conference on Semantic Computing, ICSC 2021 - Virtual, Laguna Hills, United States
Duración: ene 27 2021ene 29 2021

Serie de la publicación

NombreProceedings - 2021 IEEE 15th International Conference on Semantic Computing, ICSC 2021

Conference

Conference15th IEEE International Conference on Semantic Computing, ICSC 2021
País/TerritorioUnited States
CiudadVirtual, Laguna Hills
Período1/27/211/29/21

Nota bibliográfica

Publisher Copyright:
© 2021 IEEE.

Financiación

The project described was supported by the National Institutes of Health through the NIH HEAL Initiative under grant number UM1DA049406 and the National Center for Advancing Translational Sciences through grant number UL1TR001998. ACKNOWLEDGMENT The project described was supported by the National Institutes of Health through the NIH HEAL Initiative under award number UM1DA049406 and the National Center for Advancing Translational Sciences through grant number UL1TR001998. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

FinanciadoresNúmero del financiador
National Institutes of Health (NIH)
National Institutes of Health (NIH)UM1DA049406
National Center for Advancing Translational Sciences (NCATS)UL1TR001998
National Institutes of Health (NIH)
National Institutes of Health (NIH)UM1DA049406
National Center for Advancing Translational Sciences (NCATS)UL1TR001998

    ASJC Scopus subject areas

    • Artificial Intelligence
    • Computer Science Applications
    • Decision Sciences (miscellaneous)

    Huella

    Profundice en los temas de investigación de 'Extracting Semantics from Census-based Reference Data'. En conjunto forman una huella única.

    Citar esto