- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
Maven, a new social network backed by OpenAI’s Sam Altman, found itself in a controversy today when it imported a huge amount of posts and profiles from the Fediverse, and then ran AI analysis to alter the content.
I was confused why a package manager would need to import posts from a social network.
Why name a new product the same as a very popular existing product?
I mean maven is super bloated so it wouldn’t surprise me
I was confused on what they were trying to accomplish, and even after reading the article I am still somewhat confused.
Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages. Users can turn up the serendipity slider to branch out beyond their stated interests, and the algorithm running the platform connects users with related interests.
Perhaps I’m a minority, but I don’t see myself getting much utility out of this. I already know what my interests are, and don’t have much interest in growing them algorithmically. If a topic is really interesting, I’ll eventually find out about it via an actual human.
TikTok is really popular operating on essentially the same principle. I, for one want nothing to do with that.
Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages.
Motherfucker this is what hashtags are for.
Am I misunderstanding this, or did they just fuck up the integration so it’s one way with a plan to make it two ways after, and the AI alteration is just sentiment analysis on whatever they took?
They kind of fucked up everything in approaching this by not talking to the community and collecting feedback, making dumb assumptions in how the integration was supposed to work, leaking private posts, running everything through their AI system, and neglecting to represent the remote content as having came from anywhere else.
The other thing is that Maven’s whole concept is training an AI over and over again on the platform’s posts. Ostensibly, this could mean that a lot of Fediverse content ended up in the training data.
Genuine question, do instances not have a GPL license on their content? With that license, anyone can use all the data but only for open source software.
Instances don’t actually own the copyright to comments. The poster owns the copyright and licenses it to the instance. Which lets the instance use it, but not sublicense to others.
The current assumption made by these companies is that AI training is fair use, and is therefore legal regardless of license. There are still many ongoing court cases over this, but one case was already resolved in favor or the fair use position.
I don’t think you can use gpl for anything but code. Creative commons license would be more appropriate.