Poisoned AI went rogue during training and couldn't be taught to behave again in 'legitimately scary' study

L4sBot@lemmy.world · 2 years ago

JustMy2c@lemm.ee · 2 years ago

So you’re saying that “Inflammatory data” isn’t a reference to reddit? :D

Daxtron2@startrek.website · 2 years ago

Not inherently, I’m sure that’s part of it but it’s really everywhere. Even here on Lemmy I’ve run into nasty folk

JustMy2c@lemm.ee · 2 years ago

True but it’s reddit that’s served as a base for most models…

Daxtron2@startrek.website · 2 years ago

Not just reddit, LAION is a huge dataset

JustMy2c@lemm.ee · 2 years ago

Obviously but reddit is in the goldilocks zone where you get coherent intelligent stuff and humor and facts.

But it’s still toxic for an Ai.

Daxtron2@startrek.website · 2 years ago

Saying it served as the base for most models is just objectively incorrect though

JustMy2c@lemm.ee · 2 years ago

Correcto but maybe it DOES apply to most asked questions, if you know where I’m going with that

Chris@lemmy.world · 2 years ago

No, LLM is the AI, OP is saying if you train it with hate it’s gonna spit out hate

JustMy2c@lemm.ee · 2 years ago

And I’m saying that reddit data is sublime for Ai. And specifically that it’s invested with toxicity