Energy-Efficient Large Language Models

Authors

  • Dobariya Jenish Nileshbhai, Asst. Prof. Rupal J. Shilu Department of Computer Engineering, Atmiya University, Rajkot, India Author

DOI:

https://doi.org/10.70849/IJSCI

Keywords:

Large language model, energy efficiency, quantisation, sparse Mixture-of-Experts, green AI

Abstract

The ecological and economic burden of large pre-trained language models (LLMs) is now on par with mid-sized manufacturing facilities. Recent work has shown careful combinations of sparse architectures, parameter-efficient fine-tuning, and aggressive low-precision inference techniques can reduce total energy use by 35 – 73% maintaining task accuracy. In this evidence synthesis, we review ten significant studies published from 2024 to 2025, which prioritise energy efficiency across the entire LLM lifecycle. We categorise the studies within three layers: architecture/pre-training, fine-tuning, and inference - synthesising benefits; practical trade-offs; benchmarking gaps; and open research questions. Finally, we provide a roadmap to achieve sub-50 Wh chatbot sessions through hardware-software co-design and standardised reporting of the entire lifecycle.

Downloads

Published

06-10-2025

How to Cite

[1]
Dobariya Jenish Nileshbhai, Asst. Prof. Rupal J. Shilu, “Energy-Efficient Large Language Models”, Int. J. Sci. Inno. Eng., vol. 2, no. 10, pp. 160–164, Oct. 2025, doi: 10.70849/IJSCI.