this post was submitted on 24 Jan 2025
199 points (98.5% liked)
technology
23473 readers
262 users here now
On the road to fully automated luxury gay space communism.
Spreading Linux propaganda since 2020
- Ways to run Microsoft/Adobe and more on Linux
- The Ultimate FOSS Guide For Android
- Great libre software on Windows
- Hey you, the lib still using Chrome. Read this post!
Rules:
- 1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
- 2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
- 3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
- 4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
- 5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
- 6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
- 7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.
founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
So AI could be more sustainable this whole time? Interesting…
Who wants to bet their AI will be used for more useful things too?
AI, as in machine learning, has always had uses. Like, real useful additions to human capability. It’s very useful in pattern recognition and statistical analysis of various things.
Our more modern shit connotation comes from capitalists trying replace labor with generative AI, which is a small subset of potential machine learning uses, but by its nature sends massive amounts of shlock at us.
ML has made our xsh translations coherent.
Having dealt with ML engineers in depth before, American tech companies tend to throw blank checks their way which, combined with them not tending to have backgrounds in optimization or infrastructure, means they spin up 8 billion GPU instances in the cloud and use 10% of them ever because engineers are lazy.
They could, without any exaggeration, reduce their energy consumption by a factor of ten with about two weeks of honest engineering work. Yes this bothers the fuck out of me.
That's my biggest gripe with mainstream closed source AI, they can optimize some of their most powerful MLAs to run on a potato but... they don't. And they'll never open source becaue it'd be forked by people who are genuinely passionate about improvement.
AKA they'd be run outta business in no time.
Is it actually “sustainable” though? I’d like to see some wattage numbers.
I said this in the other thread but I’ll say it again: our sophisticated artificial intelligence, their planet burning chatbot.
Unlike in the West where if Nvidia tanks then the entire US economy goes down with it.
Idk about sustainability. It still requires massive computing infrastructure, which for now requires unsustainable ways of sourcing metals
What unsustainable ways of sourcing metals are you thinking about here?
rare earth mining
I think, unfortunately, REE mining is pretty far down the list of environmentally polluting extractive industries.
Not even using kids in the Congo to mine them.
Yeah a lot of the breakthrough research has happened via cheaper / resource limited methods now that the cat is out of the bag. We live in interesting times.
It's not that unsustainable unless you believe the predicts that your microwave will be building it's own LLM model every month by 2030
The electricity and water usage isn't very high in general or compared to data centre usage more broadly (and is basically nothing compared to crypto)
source: ML guy i know so this could be entirely unsubstantiated but apparently the main environmental burden of LLM infrastructure comes from training new models not serving inference from already deployed models.
That might change now that companies are creating “reasoning” models like DeepSeek R1. They aren’t really all that different architecturally but they produce longer outputs which just requires more compute.