Green AI: Sharing Language Models for a Sustainable Future

Green AI: Sharing Language Models for a Sustainable Future

Transformers, despite their impressive capabilities, come at a significant environmental cost. Training these large models on massive datasets requires substantial computing power, leading to increased carbon emissions.

The below graph highlights the environmental impact of various human activities, emphasizing the need for more sustainable AI development.


Image obtained from

Training Large Language Models (LLMs) is a computationally intensive process with significant energy costs.The type of energy used plays a crucial role in determining the environmental impact. While renewable energy sources can mitigate these concerns, reliance on non-renewable fuels like coal can exacerbate them.

Several factors contribute to the energy consumption of LLM training:

  • Training Time: Longer training durations lead to increased energy consumption.
  • Hardware Type: The computational power of the hardware used directly affects energy usage.
  • Training Methodology: Practices like using pre-trained models, fine-tuning, and starting with smaller experiments can help optimize energy consumption.

To further reduce energy costs:

  • Understand Hyperparameter Ranges: Knowing the typical ranges for hyperparameters can help minimize unnecessary experimentation.
  • Employ Random Search: Random search often provides comparable results to grid search while using fewer resources.

By adopting these strategies, researchers and developers can contribute to more sustainable LLM training practices.


Imagine if each time your company wanted to train a model, it did so from scratch. This would lead to huge, unnecessary global costs!

This is why sharing language models is super important: Sharing the trained weights and building on top of already trained weights reduces the overall compute cost and carbon footprint of the community.


Following are some of the tools to find the carbon footprint of your activities

  1. https://mlco2.github.io/impact/
  2. https://codecarbon.io

Srinath Mulugund

Vice President - Quality Engineering (AI & ML)

5 个月

Nice article.

Pushparaj Kamaraj

Head of engineering | Fraud and Risk Management | DFS | Digital payment | Lending | BaaS | 3DSS | Digital transformation using cloud technologies

5 个月

Thanks Sudhindra, Magadi for the insight. This is an interesting topic for the governments to invest in. Instead of optimising the power usage private companies are focusing on nuclear energy which will open a Pandora box. Refer the below article https://www.theverge.com/2023/9/26/23889956/microsoft-next-generation-nuclear-energy-smr-job-hiring

要查看或添加评论,请登录

Sudhindra, Magadi的更多文章

社区洞察

其他会员也浏览了