Energy-Efficient Large Language Models

Dobariya Jenish Nileshbhai, Asst. Prof. Rupal J. Shilu

doi:10.70849/IJSCI

Authors

Dobariya Jenish Nileshbhai, Asst. Prof. Rupal J. Shilu Department of Computer Engineering, Atmiya University, Rajkot, India Author

DOI:

https://doi.org/10.70849/IJSCI

Keywords:

Large language model, energy efficiency, quantisation, sparse Mixture-of-Experts, green AI

Abstract

The ecological and economic burden of large pre-trained language models (LLMs) is now on par with mid-sized manufacturing facilities. Recent work has shown careful combinations of sparse architectures, parameter-efficient fine-tuning, and aggressive low-precision inference techniques can reduce total energy use by 35 – 73% maintaining task accuracy. In this evidence synthesis, we review ten significant studies published from 2024 to 2025, which prioritise energy efficiency across the entire LLM lifecycle. We categorise the studies within three layers: architecture/pre-training, fine-tuning, and inference - synthesising benefits; practical trade-offs; benchmarking gaps; and open research questions. Finally, we provide a roadmap to achieve sub-50 Wh chatbot sessions through hardware-software co-design and standardised reporting of the entire lifecycle.

Energy-Efficient Large Language Models

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

Make A Submission

Call For Papers

DOI (Crossref) : 10.70849/ijsci

Impact Factor(2024) : 1.6

Important Links

Article Template

Major Indexing partner

License and Author Agreements