this post was submitted on 07 Feb 2024
218 points (98.7% liked)

Technology

59308 readers
4851 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

First, applicant argues that the mark is not merely descriptive because consumers will not immediately understand what the underlying wording "generative pre-trained transformer" means. The trademark examining attorney is not convinced. The previously and presently attached Internet evidence demonstrates the extensive and pervasive use in applicant's software industry of the acronym "GPT" in connection with software that features similar AI technology with ask and answer functions based on pre-trained data sets; the fact that consumers may not know the underlying words of the acronym does not alter the fact that relevant purchasers are adapted to recognizing that the term "GPT" is commonly used in connection with software to identify a particular type of software that features this AI ask and answer technology. Accordingly, this argument is not persuasive.

you are viewing a single comment's thread
view the rest of the comments
[–] kogasa@programming.dev 4 points 9 months ago (1 children)

You can't really say any GPT model has nothing to do with OpenAI. They invented the architecture. But the name GPT predates their commercial products using the technology.

[–] NevermindNoMind@lemmy.world 3 points 9 months ago (1 children)

I don't know enough to know whether or not that's true. My understanding was that Google's Deep mind invented the transformer architecture with their paper "all you need is attention." A lot, if not most, LLMs use a transformer architecture, though your probably right a lot of them base it on the open source models OpenAI made available. The "generative" part is just descriptive of the model generating outputs (as opposed to classification and the like), and pre trained just refers to the training process.

But again I'm a dummy so you very well may be right.

[–] kogasa@programming.dev 2 points 9 months ago

The attention paper from Google introduced transformers, OpenAI introduced generative pretraining as a technique that allows transformers to achieve very good performance on downstream tasks with very little additional fine tuning. This paper and the subsequent release of the pretrained GPT models directly lead to the LLM boom.

https://s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf