this post was submitted on 02 Aug 2023
360 points (94.1% liked)
Technology
59308 readers
4864 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
More than double, as query size is very much connected to the effective cost of the generation, and you'd need to include both the query and initial response in that second pass.
Then - you might need to make an API call to a search engine or knowledge DB to fact check it.
And include that data as context along with the query and initial response to whatever decides if it's BS.
So for a dumb realtime chat application, no one is going to care enough to slow out down and exponentially increase costs to avoid hallucinations.
But for AI replacing a $120,000 salaried role in writing up a white paper on some raw data analysis, a 10-30x increase over a $0.15 query is more than acceptable.
So you will see this approach taking place in enterprise scenarios and professional settings, even if we may never see them in chatbots.