TY - UNPB
T1 - Institutional Platform for Secure Self-Service Large Language Model Exploration
AU - Bumgardner, V. K. Cody
AU - Klusty, Mitchell A.
AU - Logan, W. Vaiden
AU - Armstrong, Samuel E.
AU - Hickey, Caylin
AU - Talbert, Jeff
N1 - 10 pages 11 figures, 5 listings, 4 tables
PY - 2024/2/1
Y1 - 2024/2/1
N2 - This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make large, customized language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key features, encompassing dataset curation, model training, secure inference, and text-based feature extraction. We illustrate the establishment of a tenant-aware computational network using agent-based methods, securely utilizing islands of isolated resources as a unified system. The platform strives to deliver secure LLM services, emphasizing process and data isolation, end-to-end encryption, and role-based resource authentication. This contribution aligns with the overarching goal of enabling simplified access to cutting-edge AI models and technology in support of scientific discovery.
AB - This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make large, customized language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key features, encompassing dataset curation, model training, secure inference, and text-based feature extraction. We illustrate the establishment of a tenant-aware computational network using agent-based methods, securely utilizing islands of isolated resources as a unified system. The platform strives to deliver secure LLM services, emphasizing process and data isolation, end-to-end encryption, and role-based resource authentication. This contribution aligns with the overarching goal of enabling simplified access to cutting-edge AI models and technology in support of scientific discovery.
KW - cs.CR
KW - cs.AI
KW - cs.CL
M3 - Preprint
BT - Institutional Platform for Secure Self-Service Large Language Model Exploration
ER -