Skip to main navigation Skip to search Skip to main content

A Light-Speed Large Language Model Accelerator with Optical Stochastic Computing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

To address the increasingly intensive computational demands of attention-based large language models (LLMs), there is a growing interest in developing energy-efficient and high-speed hardware accelerators. To that end, photonics is being considered as an alternative technology to digital electronics. This work introduces a novel optical hardware accelerator that leverages stochastic computing principles for LLMs. Our proposed accelerator incorporates full-range optical stochastic multipliers and stochastic-analog compute-capable optical-to-electrical transducer units to efficiently handle static and dynamic tensor computations in attention-based models. Our analysis shows that our accelerator exhibits at least 7.6× speedup and 1.3× lower energy compared to state-of-the-art LLMs hardware accelerators.

Original languageEnglish
Title of host publicationGLSVLSI 2025 - Proceedings of the Great Lakes Symposium on VLSI 2025
Pages922-928
Number of pages7
ISBN (Electronic)9798400714962
DOIs
StatePublished - Jun 29 2025
Event35th Edition of the Great Lakes Symposium on VLSI 2025, GLSVLSI 2025 - New Orleans, United States
Duration: Jun 30 2025Jul 2 2025

Publication series

NameProceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI

Conference

Conference35th Edition of the Great Lakes Symposium on VLSI 2025, GLSVLSI 2025
Country/TerritoryUnited States
CityNew Orleans
Period6/30/257/2/25

Bibliographical note

Publisher Copyright:
© 2025 Copyright held by the owner/author(s).

Keywords

  • inference acceleration
  • optical computing
  • silicon photonics
  • stochastic computing
  • Transformer neural networks

ASJC Scopus subject areas

  • General Engineering

Fingerprint

Dive into the research topics of 'A Light-Speed Large Language Model Accelerator with Optical Stochastic Computing'. Together they form a unique fingerprint.

Cite this