this post was submitted on 26 Jan 2025
243 points (95.5% liked)
Memes
46522 readers
1254 users here now
Rules:
- Be civil and nice.
- Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
They literally did exactly that. Show me the training data. If it has been provided under an open source license, then I'll revise my statement.
You literally cannot create a useful LLM without the training data. That is a part of the framework used to create the model, and they kept that proprietary. It is a part of the source. This is such an obvious point that I should not have to state it.