This paper reports the classification accuracy of artificial neural network (ANN) models in reproducing health consumers’ classification practices in social media. Social media have driven the growth of unstructured text data across domains including health, which motivates researchers to reconsider the epistemological approach to automated classification. This study compared the performance of several types of ANN models and ensemble models based on classification results and the integration of multiple ANN structures. To train these models, two dictionaries were employed: health consumers’ terms extracted from questions and answers in the health categories of Yahoo!Answers and MeSH terms. All three types of individual classifiers demonstrated accuracies of around 90%. In particular, the fully connected ANN with two layers produced relatively higher classification performances than a convolutional neural network and long short-term memory. Ensemble models based on classification results outperformed not only the ensemble models based on the integration of heterogeneous ANN structures but also individual deep-learning models. The combination of questions and best answers were found to be most effective as a training dataset to build an accurate prediction model. The findings suggest that ANN models can be an effective assistive tool in classifying online health resources generated by health consumers in natural language.
|Number of pages||30|
|Journal||Journal of Library and Information Studies|
|State||Published - 2022|
Bibliographical notePublisher Copyright:
© 2022, National Taiwan University, Department of Library and Information Science. All rights reserved.
- Artificial Neural Network
- Automated Classification
- Deep Learning
- Ensemble Classification Model
- Knowledge Organization
ASJC Scopus subject areas
- Library and Information Sciences