this post was submitted on 26 Jan 2025
243 points (95.5% liked)

Memes

46522 readers
1254 users here now

Rules:

  1. Be civil and nice.
  2. Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] trevor@lemmy.blahaj.zone 2 points 2 days ago* (last edited 2 days ago)

They did not release the final model without the data

They literally did exactly that. Show me the training data. If it has been provided under an open source license, then I'll revise my statement.

You literally cannot create a useful LLM without the training data. That is a part of the framework used to create the model, and they kept that proprietary. It is a part of the source. This is such an obvious point that I should not have to state it.