HACKER Q&A
📣 danschmitt

What is your take on carbon emissions of Machine Learning


At the latest since the widely cited article on Green AI [1], there has been increased discussion about the energy consumption of ML. Whenever I talk to colleagues about this, we always agree that there is a dangerous potential lying dormant here, but somehow it cannot be properly grasped. There is simply a lack of really good data. Calculators for emission like from Victor Schmidt and the whole Yoshua Bengio Team at Mila [2] [3] are very good tools for your own use, but can tell you little about the overall situation. So here are two questions for the Hackernews community: 1) How do you assess the topic of ML energy consumption in general and what are factors you consider important? 2) Is it possible to make the energy consumption of ML transparent via legal regulation (such as the currently negotiated AI Act of the EU) and if so, how can something like this look exactly? Or would this be a disaster in the making, because in reality ML is only a small building block of a larger architecture and thus far too laborious to measure for small stakeholder?

[1] https://arxiv.org/abs/1907.10597 [2] https://arxiv.org/abs/1910.09700 [3] https://github.com/mlco2/impact


  👤 max_ Accepted Answer ✓
To cut carbon emissions significantly, we need to move away from fossil fuels.

Anything else (other sources of emission) is simply a distraction from this simple & obvious solution.


👤 Bostonian
A carbon tax would increase the price of electricity. People could decide if the improved accuracy of a more complicated model is worth more than the extra power costs.

👤 mattwilsonn888
Interesting how the majority of use, if economics is to help us judge, of machine learning is to manipulate people into wasting time or buying certain products and the discussion of energy so far, has no one up in arms against it, and I doubt there are many brave enough here to risk being called a Luddite to do so. Crypto currencies certainly don't get the same nuanced treatment - but perhaps discussion is too filled with loud minorities.

👤 axg11
(Context: I work on recommendation systems at a large co - opinions my own)

I think there is a real blind spot in terms of the carbon emissions of ML, somewhat on the research side, but particularly on the production side. Let's take YouTube for example (note: I don't work for YouTube/Google, but I am speculating based on experience with similar complex systems).

A typical YouTube user will interact with a variety of machine learning powered systems:

- Newly uploaded videos to watch (recommendations)

- Recommendations defined by category

- Auto-play next video

- Search

- Ads

- Automated video captions

Each of these is likely a separate team or multiple teams. Each of these will be powered by one or more machine learning models as part of a complex pipeline. These ML models may analyze video content (computationally expensive), audio, user history, etc. Some of these ML models will run in real time, some of them will recompute huge datasets on a daily basis.

What's my point? Very few people have any insight into the carbon impact of all this compute. Perhaps it's negligible, perhaps it's huge, we just don't know. In the last two years, there has been a huge focus on the carbon emissions of PoW cryptocurrencies (e.g. Bitcoin). The impact is real, but I also believe crypto is an easy target. The data is public, hash rate is known and total power use is easy to estimate. The same can't be said for ML, there are a lot of hidden variables and few people are aware of the vast amount of compute that occurs to serve their favourite apps.

Final note: I'm only using YouTube as a common relatable example here. Almost every complex modern service will look similar in terms of number of ML models and downstream carbon impact.


👤 lostdog
The emissions of ML are not a special category. It's like caring about the emissions of addition vs subtraction, but ignoring every other math operation.

Training is infrequent, and everyone involved cares about making training more efficient and faster.

If you care about emissions, then tax, penalize, and regulate emissions. Everyone doing ML will then look at what's costly, and adjust how they use ML models to fit their budget.

Frankly, ML is the best and most efficient way to solve many important problems. For some problems, it's the only technique that works. Trying to limit ML as though it's a special source of emissions is one of the most counterproductive lines of thinking I've ever come across.


👤 crate_barre
The rate of carbon emission growth is a concern in places like India and China, not from bitcoin or machine learning. Someone should check me on this but I don’t even think getting the entire West to reduce carbon emissions will mean anything unless you can slow down the parabolic rate of India and China.

So, no, machine learning is like at the bottom of the list in terms of concerns.


👤 cultofmetatron
carbon emissions from deep learning and machine learning will drop once we switch to the new analog hybrid circuitry thats being developed.