Resumen
We present preliminary findings in extracting semantics from reference data generated by the United States Census Bureau. US Census reference data is based upon surveys designed to collect demographics and other socioeconomic factors by geographical regions. These data sets contain thousands of variables; this complexity makes the reference data difficult to learn, query, and integrate into analyses. Researchers often avoid working directly with US Census reference data and instead work with census-derived extracts capturing a much smaller subset of records. We propose to use natural language processing to extract the semantics of census-based reference data and to map census variables to known ontologies. This semantic processing reduces the large volume of variables into more manageable sets of conceptual variables that can be organized by meaning and semantic type.
| Idioma original | English |
|---|---|
| Título de la publicación alojada | Proceedings - 2021 IEEE 15th International Conference on Semantic Computing, ICSC 2021 |
| Páginas | 88-89 |
| Número de páginas | 2 |
| ISBN (versión digital) | 9781728188997 |
| DOI | |
| Estado | Published - ene 2021 |
| Evento | 15th IEEE International Conference on Semantic Computing, ICSC 2021 - Virtual, Laguna Hills, United States Duración: ene 27 2021 → ene 29 2021 |
Serie de la publicación
| Nombre | Proceedings - 2021 IEEE 15th International Conference on Semantic Computing, ICSC 2021 |
|---|
Conference
| Conference | 15th IEEE International Conference on Semantic Computing, ICSC 2021 |
|---|---|
| País/Territorio | United States |
| Ciudad | Virtual, Laguna Hills |
| Período | 1/27/21 → 1/29/21 |
Nota bibliográfica
Publisher Copyright:© 2021 IEEE.
Financiación
The project described was supported by the National Institutes of Health through the NIH HEAL Initiative under grant number UM1DA049406 and the National Center for Advancing Translational Sciences through grant number UL1TR001998. ACKNOWLEDGMENT The project described was supported by the National Institutes of Health through the NIH HEAL Initiative under award number UM1DA049406 and the National Center for Advancing Translational Sciences through grant number UL1TR001998. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
| Financiadores | Número del financiador |
|---|---|
| National Institutes of Health (NIH) | |
| National Institutes of Health (NIH) | UM1DA049406 |
| National Center for Advancing Translational Sciences (NCATS) | UL1TR001998 |
| National Institutes of Health (NIH) | |
| National Institutes of Health (NIH) | UM1DA049406 |
| National Center for Advancing Translational Sciences (NCATS) | UL1TR001998 |
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Decision Sciences (miscellaneous)
Huella
Profundice en los temas de investigación de 'Extracting Semantics from Census-based Reference Data'. En conjunto forman una huella única.Citar esto
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver