TY - JOUR
T1 - Local Large Language Models for Complex Structured Tasks
AU - Bumgardner, V K Cody
AU - Mullen, Aaron
AU - Armstrong, Samuel E
AU - Hickey, Caylin
AU - Marek, Victor
AU - Talbert, Jeff
N1 - ©2024 AMIA - All rights reserved.
PY - 2024
Y1 - 2024
N2 - This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex language tasks. The authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local, fine-tuned LLMs to respond to specific generative instructions and provide structured outputs. Over 150k uncurated surgical pathology reports containing gross descriptions, final diagnoses, and condition codes were used. Different model architectures were trained and evaluated, including LLaMA, BERT, and LongFormer. The results show that the LLaMA-based models significantly outperform BERT-style models across all evaluated metrics. LLaMA models performed especially well with large datasets, demonstrating their ability to handle complex, multi-label tasks. Overall, this work presents an effective approach for utilizing LLMs to perform structured generative tasks on domain-specific language in the medical domain.
AB - This paper introduces an approach that combines the language reasoning capabilities of large language models (LLMs) with the benefits of local training to tackle complex language tasks. The authors demonstrate their approach by extracting structured condition codes from pathology reports. The proposed approach utilizes local, fine-tuned LLMs to respond to specific generative instructions and provide structured outputs. Over 150k uncurated surgical pathology reports containing gross descriptions, final diagnoses, and condition codes were used. Different model architectures were trained and evaluated, including LLaMA, BERT, and LongFormer. The results show that the LLaMA-based models significantly outperform BERT-style models across all evaluated metrics. LLaMA models performed especially well with large datasets, demonstrating their ability to handle complex, multi-label tasks. Overall, this work presents an effective approach for utilizing LLMs to perform structured generative tasks on domain-specific language in the medical domain.
M3 - Article
C2 - 38827047
SN - 2153-4063
VL - 2024
SP - 105
EP - 114
JO - AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
JF - AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
ER -