"…in a more careful tech sector, wouldn’t the system have launched with better guardrails, instead of relying on random users and journalists to flag the issue?..
Well, yeah… but Reddit has become such an algorithm-driven, profit-oriented hot mess now that shit like this slips through. The inevitable consequence of growth and hitting the level where an IPO can be launched.
And the guardrails possible on AI are laughably bad. For fun, look up some of the leaked system prompts for stuff like ChatGPT. They’re just using the same sort of “ask it nicely to do what you want” as the end users are, they just re-send it before almost every command.
The other big option is keyword based filtering, and anyone who’s played any MMO in the last 30 years can tell you how effective that is. People never find new ways around keyword filters.
Well, yeah… but Reddit has become such an algorithm-driven, profit-oriented hot mess now that shit like this slips through. The inevitable consequence of growth and hitting the level where an IPO can be launched.
And the guardrails possible on AI are laughably bad. For fun, look up some of the leaked system prompts for stuff like ChatGPT. They’re just using the same sort of “ask it nicely to do what you want” as the end users are, they just re-send it before almost every command.
The other big option is keyword based filtering, and anyone who’s played any MMO in the last 30 years can tell you how effective that is. People never find new ways around keyword filters.