Its a bit old, but I just learned it via the retro-dodo article here: https://retrododo.com/google-is-killing-retro-dodo/
Is it just me or are 60 million a ridiculously small price for that whole dataset?
I wonder if Google’s unlimited legal budget plays a role. Not a lawyer, so probably way off here…
But, for example, reddit’s success in part depends on Google ingesting their data — reddit shows up in Google searches all the time, which can only happen if Google uses reddit’s content. So reddit telling Google “you can’t use our content” doesn’t work, and they need to say something like, “you can use our content for search results but you can’t consume it as training data.”
This is a pretty straightforward statement/request/demand, but one could imagine Google lawyers maliciously complying and throwing their hands up dramatically, claiming “well we use some amount of AI in our search results, so if we can’t use your content for AI training then we can’t risk using it for search results.” Which would, I imagine, really, really hurt reddit (no Google results would be catastrophic I suspect).
So, perhaps the “low” 60M figure is just Google using their leverage.
Or not. As a random person on the Internet, I can say I’m probably not contributing anything meaningful here…
How quickly you forget that half of it is just “I also choose this guy’s wife” and “the narwhal bacon’s at midnight”
Considering it’s all full of Nazis and bots, and if you get to filter all of them out you’re left with reposts and low quality memes followed by comments that represent the hostile side of each of us… I’d say anything over $5 is a good deal for spez.
Now, I hope Google uses this data exclusively for detecting inappropriate answers. Can you imagine it giving answers based on the endless threads i of " I’m not your mate, bro; I’m not your bro, dude…".
It’s more than they were making from third party apps, hence the ridiculous API fees.
Can’t wait to see an AI chatbot in my Google searches that behaves like a typical redditor.
This.
I mean one of the most popular search types on Google is <topic + Reddit> so not much would change
Just wait till the LLM starts “singing” randomly to you.
– Hey Google/reddit, what does xxxxxx mean?
–Wtf is people so lazy, Google it yourself it’s only 5 seconds!
–But but, you are Google, are you not?
–Buahaha , haha!
deleted by creator
Steve Huffman looks increasingly douchier and shittier with every passing photo.
What a damn chode. Fuck that guy.
Can someone point me the way of that bot or whatever that changes all your old Reddit posts before deleting them? I thought I had it saved somewhere but I can’t find it now and have no idea what it’s called.
They keep copies of posts because people who mass edited their posts saw them reverted or have people reply still as if they were not edited.
Plus they can easily just detect mass edits, and ship the state prior to that event.
I had read that with some people, is was a delay from their server instance between read/write and in the end the changes did end up sticking, but I don’t know if that was true. A lot of people were mass editing at the same time, and since editing isn’t something that happens super frequently, it might have less priority in the stack and caused backups.
They change it on their website but the data that’s collected and sold isn’t changed.
It still devalues their google search though but also makes it harder to scrap data for free and ups the value of what they are selling.
For sure. They definitely have change records for everything. It would be borderline negligent if they didn’t.
I hope AI sais fuck Spez a lot:-)
You can be sure the little fucker hired people to filter out that sort of stuff from the data
AI be like “stfu regard”
AI be like there things are over they’re
I deleted my comment history after the API exodus. I’m sure they could dig it up if they wanted but at least they’ll have to click like 3 more buttons if they want to train AI on my nonsense.
Before:
SELECT * FROM `comments` WHERE is_deleted=0;
After:
SELECT * FROM `comments`;
60m? Ms got a steal no wonder Reddit can’t monetize
is there a way to mass delete my old content? the service i used in the past doesn’t seem to have worked. i recently got a reply from a 6 year-old post from someone saying they got there on google.
My understanding is that the mass delete you did probably had worked, but reddit rolled back your deletions. I heard it happened to a lot of mass deleters after the lemmy exodus.
Can we still mass edit our previous comments with random stuff, a little bit at a time to avoid detection? Poison the data, yada yada.
Is worse nothing gotten really deleted admins admitted in like 2018 that they can see deleted posts. I think even some mods can. The access they give to Google is to the backend they can see EVERYTHING.
I think I’m gonna be sick. so all the stuff I wrote, it’s just THERE? what the fuck do i do? what about private info that I dont want on a public fucking search engine?? I’ve had that account since I was a kid, there’s a lot of shit I regret posting, what the FUCK!
Google cached it maybe?
Minimum royalty laws should exist.
That’s not great news when weighed against my desire to watch reddit crash and burn.
All we can do is make something better, reddit will do their thing and we will do ours.
I wonder how they will decide how to dole that out to their contributors… oh, wait.
*your posts
At first, I thought he was holding a big pile of cash in the thumbnail.
Cunt