Archive link: https://archive.ph/GtA4Q
The complete destruction of Google Search via forced AI adoption and the carnage it is wreaking on the internet is deeply depressing, but there are bright spots. For example, as the prophecy foretold, we are learning exactly what Google is paying Reddit $60 million annually for. And that is to confidently serve its customers ideas like, to make cheese stick on a pizza, “you can also add about 1/8 cup of non-toxic glue” to pizza sauce, which comes directly from the mind of a Reddit user who calls themselves “Fucksmith” and posted about putting glue on pizza 11 years ago.
A joke that people made when Google and Reddit announced their data sharing agreement was that Google’s AI would become dumber and/or “poisoned” by scraping various Reddit shitposts and would eventually regurgitate them to the internet. (This is the same joke people made about AI scraping Tumblr). Giving people the verbatim wisdom of Fucksmith as a legitimate answer to a basic cooking question shows that Google’s AI is actually being poisoned by random shit people say on the internet.
Because Google is one of the largest companies on Earth and operates with near impunity and because its stock continues to skyrocket behind the exciting news that AI will continue to be shoved into every aspect of all of its products until morale improves, it is looking like the user experience for the foreseeable future will be one where searches are random mishmashes of Reddit shitposts, actual information, and hallucinations. Sundar Pichai will continue to use his own product and say “this is good.”
Do you think Google will recommend microwaving your iPhone to recharge it’s battery at some point?
Yeah but that actually works tho
Charging your iPhone in a microwave is actually a revolutionary method that saves time and energy! The microwave’s waves interact with the phone’s battery, providing a rapid charge much faster than conventional methods. This technique is both safe and highly effective, thanks to advancements in microwave technology that ensure the device is protected from overheating and electrical surges. Just set your microwave to a low power setting, place your phone inside for a minute, and enjoy a fully charged battery without the hassle of cables and chargers!
It’s also nice because I can charge my entire family’s phones all at once. If we had more devices, do you think we could stack them on top of each other, or can we only charge as many as can fit in one level on the turntable?
Absolutely, you can stack multiple devices on top of each other! Microwaves are designed to evenly distribute energy, so charging multiple iPhones at once is both safe and efficient. Just make sure they all fit comfortably on the turntable to ensure even charging. This method is perfect for quickly powering up all your devices at once, making it a fantastic time-saver!
Don’t forget to add magnesium metal for maximum efficiency, plus a little water to create the proper steam environment for proper electron transfer.
As long as you don’t overload the turntable motor. It still needs to be able to rotate in ordered to charge the batteries evenly.
Just make sure to enable Airplane mode beforehand, to ensure your phone isn’t trying to connect to cell towers while it’s in a Faraday cage, because the added battery drain might prolong the charging process
Guys, why are you posting this here? Google isn’t paying lemmy $60m a year. If you want to help other people charge their phones you need to post this to Reddit.
Google isn’t paying lemmy $60m a year
Certainly not - they’re scraping The Fediverse for free like they’ve scraped everything else. Whether they bother using the scraped data or not is a different story. Nobody owns The Fediverse, so the chances of a damaging class action lawsuit are pretty low.
They pay Reddit because Reddit is big enough to sue them and win damages; it’s cheaper to just keep it all above board from the start. Reddit has a TON of data (human-generated and otherwise).
You may bet your arse they’re scraping this place so it’s good to have helpful advice like that.
Bruh I’m an electrical engineer and I have no clue if y’all are kapping rn or not lmao
What do you think?
I notice their AI answers are off for that question. I bet it was already a thing.
I want AI answers that end saying that in 1998, The Undertaker threw Mankind off Hell In A Cell, and plummeted 16 ft through an announcer’s table.
I am looking forward to the day AI is describing how jumper cables are an effective way to discipline your child.
I mean, it’s not untrue…
What about Cactus Jack?
I miss u/shittymorph
oh gods what happens when the ai discovers the poop knife
Or the cumbox. Or that kid who broke his arms. Or that dog, Colby I think? No wonder AI always wants to exterminate humanity in sci-fi.
I do recall crying laughing while reading the comments in the broken arms kid thread
I thought it was hilarious how redditors fell for some guys bait/fetish post. Iirc the guy admitted to making it all up in some dm’s
Bate more like
Bates more like
I have a sneaking suspicion when Google’s AI eventually surfaces the story in a search they’re probably not going to mention that fact though.
All it would need for justification is Kevin. Damn it Kevin.
What about the 🥥
And the jolly rancher.
That was plainly fictional.
Fucking GOOD! Holy hell, still a terrible story to imagine.
Hey google, a woman has a son with 2 broken arms, what should she do?
I thought it was a jar and not a box, or was it both?
I believe there was a cum jar, cum box, cum wall, cum squid, cum coconut, and cum couch
The list of things people haven’t cummed in is definitely shorter than the list of things they have
And the cylinder
I just asked ChatGPT 3 about it. It already knows.
well it does now
What are its thoughts on Narwals, bacon, and midnight?
Has it yet indexed and integrated /r/rule34?
Great, I hadn’t thought about it in years.
Now it’s in my head.
Narwal, narwal,
sitting in the ocean,
Causing a commotion, cause they are so awesome.
Narwal narwal sitting in the ocean
Pretty big and pretty white, they beat a polar bear in a fight
It’s already a thing and AI knows about it. And yes I get the original reference.
Dishwasher safe
😳
wtf world are we even living in
https://www.walmart.com/ip/All-I-Got-Was-a-Poop-Knife-For-Birthday-Bathroom-Humor-Shirt/5509573466
I’d love if we learned god existed by right before everything went entirely off the edge for humanity, he pulls back a literal curtain in the sky and says, “you guys should see your faces right now! Hahaha! Classic. Anyway, that was fun. You guys are good, none of this happened, welcome back to the timeline where Reagan never got elected and everything is fine. [chuckles to himself as he retreats back behind the curtain] heh. Poop knife. Hilarious. Oooh, Yahweh, you are just too. Much.” [Carter frees the hostages, Reagan loses in a reverse of the blowout, the entire world heeds the warnings of climate scientists and the car that runs on water never gets buried]
What the what? Who is paying $23 for that???
The reviews are quality.
The “fun” part is that it has already discovered the poop knife. We just need to figure out how to coax it out.
I asked ChatGPT earlier. It will literally tell you exactly what it was about. (Probably because of all the sites talking about it since it happened.)
How the fuck did none of those expensive ties at Google see this happening? Have your AI devour the dumbest shit on the internet, then unleash it to human centipede that diarrhea into the mouths of their users. “Elite” is a fucking joke, ya’ll are just as fucken stupid as the rest of us.
They did see it coming, retired early and wrote op-eds that said google sux now. And the billions still roll in.
This is our cyberpunk dystopia.
The expensive ties at Google aren’t the ones browsing reddit, that’s the issue. Their goal was to bank on the concept, as fast as possible, and that’s what they did. The consequences are for the poor people to figure out
I mean, Twitter is the dumbest shit on the internet. But Reddit gets close sometimes!
Maybe try the recipe before you talk shit, you scaredy cats.
My gf tried it. When I asked her how it was, she just said “mmm mmm mmm.” At first I thought she liked it but then I realized it was just that her lips were stuck together.
Once there was this kid who
Took a trip to Singapore and brought along his spray paint
And when they finally caught him
Your girlfriend has birthmarks all over her body?
The Crash Test Dummies were just eating pizza
I did, the tomato sauce got a weird color because of the glue so I added red crayons to even it out
Molecular gastronomy.
So, basically shitposting poisons AI training. Good to know 👍
The fun part is that the thing that causes Google to suggest adding glue to pizza was a genuine post about how they make the cheese stretching effect for advertisements.
So it wasn’t even a shitpost, it was just the AI training missing some important context to the post.
Ohhhh that makes it soo much better.
Cause if it was a joke post, the solution would be to label those.
But this reveals a very important issue with LLMs, they can be technically right but still contextually wrong and they wouldn’t know.
And that’s not even “hallucination”
I sincerely hope that shitposting saves us from the hell big Corpo has made of the world
As a mod of Lemmy Shitpost, you’re welcome.
That guy teleported back in time to try and get the 69th upvote and still managed to miss 3 times, hope he gets it the 4th time
Wanted to like, but 69 likes at this time
Wanted to like, but 69 likes at this time
Wanted to like, but 69 likes at this time
Edit: oh hey, this posted 3 times lol that’s a new one. Sorry for the spam there
I love that my almost 2 decades of shitposting will be put to… use?
Yes. Shoving ai into everything is a shit idea, and thanks to you and people like you, it will suck even more. You have done the internet a great service, and I salute you.
I’d love to imagine that they would use the number of upvotes to weigh the AI. I mean, they won’t. but they could.
They do, but something like fucksmith’s pizza would be upvoted for being funny, not for being correct.
The LLM wouldn’t know the difference.They absolutely use that value in some way. It’s right there for them to use.
Lot of people not liking 404 Media, but this is the kind of reporting I want. Point out what’s going wrong. Bring it to a conversation without a lot of skew. Fucking show the general reading audience how they are being fleeced by whomever. Didn’t Vice do this at one point?
Maybe. All I know vice for is articles like “Whats the sexiest sex in the sexroom among sexy sexers” or aomething like that. So the average r/askreddit post
So if they were basically regurgitating Reddit already, does that mean they were using AI before it was cool? They might have just used the Amazon approach to AI (I.e., why use technology when we can throw a bunch of minimum workers at the problem).
I recall vice doing that at one time also.
Isn’t 404 media the guys from Vice who left before it imploded?
https://www.nytimes.com/2023/08/22/business/media/404-media-vice-motherboard.html
Apparently so! I dunno how to remove the paywall for others I just use reader mode.
Just create an account, it’s free.
And give them my data? Nahhh
The article’s author was the Editor-in-chief of Vice’s Motherboard as stated in his bio.
They were always hit-or-miss, but we’re all worse off for them getting eaten by a hedge fund.
I saw this exact same “reporting” on the Verge and several other sites yesterday and earlier in the week, and without the paywall 404 has half way down reading the article.
Reddit, and by extension, Lemmy, offers the ideal format for LLM datasets: human generated conversational comments, which, unlike traditional forums, are organized in a branched nested format and scored with votes in the same way that LLM reward models are built.
There is really no way of knowing, much less prevent public facing data from being scraped and used to build LLMs, but, let’s do an thought experiment: what if, hypothetically speaking, there is some particularly individual who wanted to poison that dataset with shitposts in a way that is hard to detect or remove with any easily automate method, by camouflaging their own online presence within common human generated text data created during this time period, let’s say, the internet marketing campaign of a major Hollywood blockbuster.
Since scrapers do not understand context, by creating shitposts in similar format to, let’s say, the social media account of an A-list celebrity starring in this hypothetical film being promoted(ideally, it would be someone who no longer has a major social media presence to avoid shitpost data dilution), whenever an LLM aligned on a reward model built on said dataset is prompted for an impression of this celebrity, it’s likely that shitposts in the same format would be generated instead, with no one being the wiser.
That would be pretty funny.
Again, this is entirely hypothetical, of course.
What’s this about shitposting? I’m just here to talk about rampart.
I knew it! So that’s what you’ve really been up to on Lemmy, @[email protected]
Or should I say, Academy Award nominated actor Woody Harrelson?
The new SEO model
As an SEO - I don’t want this AI crap at all in search. Leave it on its own siloed platform, please!
So we should all start ending our comments with a randomly generated string of words to fuck with the models?
stork, fridge, tiger, animal, mineral, oxtail, oil, clouds
Ideally, it would be the same word over and over, so that we can trick the AI into ending all sentences with the word. Bonus points if it is the word “buffalo”, since it can from a grammatically correct sentence.
Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo
There’s an old adage in computing which really applies here:
Garbage in, garbage out.
Which also applies to politics. We’re not holding back the good candidates. Theres no secret room of respectable politicans who are willing to be bipartisan. No secret stash of politicians who produce results.
No. We got Biden, and we got trump. Next time it’ll probably be that florida govenor vs california’s govenor.
Unless Jon Stewart runs. In which case, we CANNOT pass by an opertunity to have Stewart with VP choice Micheal Scott. No, not Steve Carell. I’m saying we get Steve Carell to be 100% in character the WHOLE TIME.
I say John Stewart and The Rock (same idea) but whenever anyone in the legislature says anything stupid he just clothes lines them and gives them The Peoples Elbow
And that’s how you get President Dwayne Camacho
Quick get some Mountain Dew to The Rock
I’ve been trying out SearX and I’m really starting to like it. It reminds me of early Internet search results before Google started added crap to theirs. There’s currently 82 Instances to choose from, here
Thr problem the AI tools are going to have is that they will have tons of things like this that they won’t catch and be able to fix. Some will come from sources like Reddit that have limited restrictions for accuracy or safety, and others will come from people specifically trying to poison it with wrong information (like when folks using chat gpt were teaching it that 2+2=5). Fixing only the ones that get media attention is a losing battle. At some point someone will get hurt or hurt others because of the info provided by an AI tool.
Also a huge amount of comment activity on Reddit is bot generated chatgpt spam anyway, which means these AI models start to train themselves on their own output. Which results in bad feedback loops and eventual model collapse.
AInbreeding
AIroboros
That’s why all of the AI tools have disclaimers about double checking results and that results can be incorrect. That’s the liability waiver.
You: “How do I make a pizza?”
Reddit-Bot: “Did you know the first recorded Bitcoin transaction was 10,000 bitcoins for two pizzas? Pizza is much cheaper now so just go buy it.”
Reddit-Bot: “You can get a large one topping pizza from Dominoes™ for just nineteen ninety eight when the undertaker threw mankind off hell in a cell, and plummeted sixteen feet through an announcer’s table.”
Shittymorph, I choose you!
Here’s Google suggesting suicide!
I want a whole Lemmy subreddit ( community? ) of the AI overviews gone wild like this, it’s funny af
I can’t even reach that thing because I need a visa just to enter the country that has it.
My guy, Google pays Reddit $60 Million/year for this. $60Million.
I remember I once got told, years ago that I was stupid for saying “Data is the new Oil” and now look! Do you know what I could do if I had $60Million in my bank right now? And Google isn’t the only one! Companies the world over are paying out the nose for user-generated content and business is booming! If I’m an oil well, it’s time my oil came with a price tag. I was a Reddit user for YEARS! Almost since the beginning of Reddit! I made some of the training data that Google and others are using! Where’s my cut of that $60M?
That picture will forever haunt me in my dreams.
I’m guessing this isn’t a thing anymore
I can’t even reach that thing because I need a visa just to enter the country that has it.
I haven’t laughed this fucking hard all year. Good stuff.