this post was submitted on 11 Jan 2024
1811 points (98.4% liked)
Open Source
31374 readers
98 users here now
All about open source! Feel free to ask questions, and share news, and interesting stuff!
Useful Links
- Open Source Initiative
- Free Software Foundation
- Electronic Frontier Foundation
- Software Freedom Conservancy
- It's FOSS
- Android FOSS Apps Megathread
Rules
- Posts must be relevant to the open source ideology
- No NSFW content
- No hate speech, bigotry, etc
Related Communities
- !libre_culture@lemmy.ml
- !libre_software@lemmy.ml
- !libre_hardware@lemmy.ml
- !linux@lemmy.ml
- !technology@lemmy.ml
Community icon from opensource.org, but we are not affiliated with them.
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
🙂 my bad
No, not sue me for lemmy comments. AI is trained with lots of data. The world wide web is full of publicly accessible data like our comments. However, not all publicly accessible data may be used without a license. Examples thereof are news paper articles, videos, still pictures, etc. Normally, if you want to use those commercially, consent has to be given by the license holder and a in some cases a fee has to be paid.
Microsoft Copilot is an AI model to help people write code. However, it was trained mostly on opensource code (code made publicly available) which was very often licensed. And it is done so in such a manner that commercial use is allowed with the obligation to make that commercial code publicly available too. Microsoft does not make the code for Copilot publicly accessible and uses code licensed in many, many other ways - and it does so without asking for consent.
This is often a double standard as companies that hide their code fight very hard to keep it secret and/or pursue those in court who do not get a license to use it. However, they will happily use licensed consent to their benefit without consent nor potential payment.
With some clever tricks, AIs have been duped into revealing their training data (often licensed, sometimes very private e.g addresses, birthday, health information, etc.). Lawsuits have ensued (against the AI owners like Microsoft) and are currently active with a pending verdict. Until the verdicts come, I add the license link to my comments. Who knows, maybe it will have an impact, maybe not.
Hopefully I could explain the situation in an understandable manner for you.
Have a good day.
CC BY-NC-SA 4.0
I see - thanks for taking the time to explain the backstory, very interesting.
You're welcome. Thank you for reading :)