From manual to machine: assessing the efficacy of large language models in content analysis

Andrew Pilny, Kelly McAninch, Amanda Slone, Kelsey Moore

Research output: Contribution to journalArticlepeer-review

Abstract

This study compares the performance of Large Language Models (LLMs) and human coders in predicting relational uncertainty from textual data. Employing various LLMs (gpt-4.0-turbo, gpt-3.5-turbo, Claude 2, llama7b-v2-chat, and llama13b-v2-chat), we found that these models perform comparably to human coders, with only minor differences in Mean Squared Error (MSE) values. However, not all LLMs performed equally, underscoring the importance of model selection. Our findings highlight the potential of LLMs as a scalable tool for content analysis, but also emphasize their nuanced application based on the specific research context. The study advances the discourse on the use of LLMs in content analysis and provides insights for future research in this rapidly evolving field.

Original languageEnglish
Pages (from-to)61-70
Number of pages10
JournalCommunication Research Reports
Volume41
Issue number2
DOIs
StatePublished - 2024

Bibliographical note

Publisher Copyright:
© 2024 Eastern Communication Association.

Keywords

  • Large language models
  • content analysis
  • interpersonal communication
  • machine learning
  • text-mining

ASJC Scopus subject areas

  • Communication

Fingerprint

Dive into the research topics of 'From manual to machine: assessing the efficacy of large language models in content analysis'. Together they form a unique fingerprint.

Cite this