this post was submitted on 10 Apr 2024

437 points (100.0% liked)

Technology

37739 readers

599 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 2 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

Los@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org

437

Ask ChatGPT to pick a number between 1 and 100 (jlai.lu)

submitted 7 months ago by ElCanut@jlai.lu to c/technology@beehaw.org

148 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] Phroon@beehaw.org 152 points 7 months ago (1 children)

“You may not instantly see why I bring the subject up, but that is because my mind works so phenomenally fast, and I am at a rough estimate thirty billion times more intelligent than you. Let me give you an example. Think of a number, any number.”

“Er, five,” said the mattress.

“Wrong,” said Marvin. “You see?”

― Douglas Adams, Life, the Universe and Everything

[–] AlexisFR@jlai.lu 9 points 7 months ago (1 children)

The mattress? Like for sleeping?

[–] Asafum@feddit.nl 41 points 7 months ago* (last edited 7 months ago)

Yep! The hitchhikers books are so much fun lol

I still think one of my favorite lines is "the ships hung in the sky in much the same way that bricks don't."

[–] Bishma@discuss.tchncs.de 125 points 7 months ago (2 children)

37 is well represented. Proof that we've taught AI some of our own weird biases.

[–] GenderNeutralBro@lemmy.sdf.org 43 points 7 months ago (8 children)

What's special about 37? Just that it's prime or is there a superstition or pop culture reference I don't know?

[–] Bishma@discuss.tchncs.de 103 points 7 months ago (11 children)

If you discount the pop-culture numbers (for us 7, 42, and 69) its the number most often chosen by people if you ask them for a random number between 1 and 100. It just seems the most random one to choose for a lot of people. Veritasium just did a video about it.

[–] metallic_z3r0@infosec.pub 28 points 7 months ago (6 children)

37 is my favorite, because 3x7x37=777 (three sevens), and I think that's neat.

load more comments (6 replies)

[–] SubArcticTundra@lemmy.ml 12 points 7 months ago (5 children)

What about 57

[–] Bishma@discuss.tchncs.de 20 points 7 months ago (1 children)

I'm curious about that too. Something is twisting weights for 57 fairly strongly in the model but I'm not show what. Maybe its been trained on a bunch of old Heinz 57 varieties marketing.

load more comments (1 replies)

load more comments (4 replies)

load more comments (9 replies)

[–] Karyoplasma@discuss.tchncs.de 18 points 7 months ago* (last edited 7 months ago) (11 children)

Probably just because it's prime. It's just that humans are terrible at understanding the concept of randomness. A study by Theodore P. Hill showed that when tasked to pick a random number between 1 and 10, almost a third of the subjects (n was over 8500) picked 7. 10 was the least picked number (if you ditch the few idiots that picked 0).

load more comments (11 replies)

load more comments (6 replies)

[–] FiniteBanjo@lemmy.today 11 points 7 months ago (4 children)

Why would that need to be proven? We're the sample data. It's implied.

[–] jarfil@beehaw.org 10 points 7 months ago (4 children)

The correctness of the sampling process still needs a proof. Like this.

load more comments (4 replies)

load more comments (3 replies)

[–] olicvb@lemmy.ca 63 points 7 months ago (1 children)

holy crap, the answer to life the universe and everything XD

[–] WarmSoda@lemm.ee 35 points 7 months ago (8 children)

More than likely it's because of that book and how often it's qouted

load more comments (8 replies)

[–] FiniteBanjo@lemmy.today 55 points 7 months ago (1 children)

No shit, sherlock, it's sample data is the internet.

[–] Appoxo@lemmy.dbzer0.com 42 points 7 months ago (2 children)

Wheres 69 then?

[–] Chadus_Maximus@lemm.ee 21 points 7 months ago (3 children)

That's a naughty number and we don't allow those.

load more comments (3 replies)

load more comments (1 replies)

[–] ReallyActuallyFrankenstein@lemmynsfw.com 42 points 7 months ago (3 children)

What does "temperature" on the Y-axis refer to?

[–] gerryflap@feddit.nl 40 points 7 months ago

I'm not a hundred percent sure, but afaik it has to do with how random the output of the GPT model will be. At 0 it will always pick the most probable next continuation of a piece of text according to its own prediction. The higher the temperature, the more chance there is for less probable outputs to get picked. So it's most likely to pick 42, but as the temperature increases you see the chance of (according to the model) less likely numbers increase.

This is how temperature works in the softmax function, which is often used in deep learning.

[–] driving_crooner@lemmy.eco.br 8 points 7 months ago

https://youtu.be/wjZofJX0v4M your answer from the 22:00 mark on.

load more comments (1 replies)

[–] HarkMahlberg@kbin.social 30 points 7 months ago* (last edited 7 months ago) (1 children)

I mean... they didn't specify it had to be random (or even uniform)? But yeah, it's a good showcase of how GPT acquired the same biases as people, from people..

[–] OsrsNeedsF2P@lemmy.ml 22 points 7 months ago

uniform

Reminds me of my previous job where our LLM was grading things too high. The AI "engineer" adjusted the prompt to tell the LLM that the average output should be 3. I had a hard time explaining that wouldn't do anything at all, because all the chats were independent events.

Anyways, I quit that place and the project completely derailed.

[–] lauha@lemmy.one 29 points 7 months ago (15 children)

Ask humans the same and most common numer is 37

[–] Catsrules@lemmy.ml 13 points 7 months ago

I saw that YouTube video as well.

load more comments (14 replies)

[–] ForestOrca@kbin.social 25 points 7 months ago (3 children)

WAIT A MINUTE!!! You mean Douglas Adams was actually an LLM?

load more comments (3 replies)

[–] FlashMobOfOne@beehaw.org 22 points 7 months ago (4 children)

HA, funny that this comes up. DND Beyond doesn't have a d100, so I opened my ChatGPT sub and had it roll a d100 for me a few times so I could use my magic beans properly.

[–] terminhell@lemmy.dbzer0.com 18 points 7 months ago (1 children)

I use the percentile die for that.

load more comments (1 replies)

[–] TauriWarrior@aussie.zone 11 points 7 months ago* (last edited 7 months ago) (1 children)

Opened up DND Beyond to check since i remember rolling it before and its there, its between D8 and D10, the picture shows 2 dice

load more comments (1 replies)

[–] Urist@lemmy.ml 9 points 7 months ago (2 children)

Roll two d10, once for each digit, and profit?

load more comments (2 replies)

load more comments (1 replies)

[–] pipows@lemmy.today 21 points 7 months ago (3 children)

LMs aren't thinking, aren't inventing, they are predicting what is supposed to be answered next, so it's expected that they will produce the same results every time

[–] xthexder@l.sw0.com 12 points 7 months ago* (last edited 7 months ago)

This graph actually shows a little more about what's happening with the randomness or "temperature" of the LLM.
It's actually predicting the probability of every word (token) it knows of coming next, all at once.
The temperature then says how random it should be when picking from that list of probable next words. A temperature of 0 means it always picks the most likely next word, which in this case ends up being 42.
As the temperature increases, it gets more random (but you can see it still isn't a perfect random distribution with a higher temperature value)

load more comments (2 replies)

[–] DarkFox@pawb.social 17 points 7 months ago (3 children)

Which model?

When I tried on ChatGPT 4, it wrote a short python script and executed it to get a random integer.

import random

# Pick a random number between 1 and 100
random_number = random.randint(1, 100)
random_number

load more comments (3 replies)

[–] xyguy@startrek.website 10 points 7 months ago

Only 1000 times? It's interesting that there's such a bias there but it's a computer. Ask it 100,000 times and make sure it's not a fluke.

[–] PhreakyByNature 9 points 7 months ago

NEEDS MOAR 69 FELLOW HUMAN

[–] Wirlocke@lemmy.blahaj.zone 8 points 7 months ago* (last edited 7 months ago) (3 children)

I'm curious, is there actually so many 42's in the system? (more than 69 sounds unlikely)

What if the LLM is getting tripped up because 42 is always referred to as the answer to "the Ultimate Question of Life, the Universe, and Everything".

So you ask it a question like give a number between 1-100, it answers 42 because that's the answer to "Everything", according to it's training data.

Something similar happened to Gemini. Google discouraged Gemini from giving unsafe advice because it's unethical. Then Gemini refused to answer questions about C++ because it's considered "unsafe" (referring to memory management). But Gemini thinks C++ is "unsafe" (the normal meaning), therefore it's unethical. It's like those jailbreak tricks but from its own training set.

load more comments (3 replies)

[–] BuboScandiacus@mander.xyz 8 points 7 months ago (1 children)

[–] Zorque@kbin.social 12 points 7 months ago (1 children)

In a row?!

load more comments (1 replies)

load more comments