Maven Imported 1.12 Million Fediverse Posts

Sean Tilley@lemmy.world · 1 year ago

Maven Imported 1.12 Million Fediverse Posts

lunarul@lemmy.world · 1 year ago

I was confused why a package manager would need to import posts from a social network.

Why name a new product the same as a very popular existing product?

GBU_28@lemm.ee · 1 year ago

I mean maven is super bloated so it wouldn’t surprise me

threelonmusketeers@sh.itjust.works · 1 year ago

I was confused on what they were trying to accomplish, and even after reading the article I am still somewhat confused.

Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages. Users can turn up the serendipity slider to branch out beyond their stated interests, and the algorithm running the platform connects users with related interests.

Perhaps I’m a minority, but I don’t see myself getting much utility out of this. I already know what my interests are, and don’t have much interest in growing them algorithmically. If a topic is really interesting, I’ll eventually find out about it via an actual human.

Zak@lemmy.world · 1 year ago

TikTok is really popular operating on essentially the same principle. I, for one want nothing to do with that.

Plopp@lemmy.world · 1 year ago

Instead, when a user posts something, the algorithm automatically reads the content and tags it with relevant interests so it shows up on those pages.

Motherfucker this is what hashtags are for.

Larry@lemmy.world · 1 year ago

Am I misunderstanding this, or did they just fuck up the integration so it’s one way with a plan to make it two ways after, and the AI alteration is just sentiment analysis on whatever they took?

Sean Tilley@lemmy.world · 1 year ago

They kind of fucked up everything in approaching this by not talking to the community and collecting feedback, making dumb assumptions in how the integration was supposed to work, leaking private posts, running everything through their AI system, and neglecting to represent the remote content as having came from anywhere else.

The other thing is that Maven’s whole concept is training an AI over and over again on the platform’s posts. Ostensibly, this could mean that a lot of Fediverse content ended up in the training data.

Grandwolf319@sh.itjust.works · 1 year ago

Genuine question, do instances not have a GPL license on their content? With that license, anyone can use all the data but only for open source software.

GamingChairModel@lemmy.world · 1 year ago

Instances don’t actually own the copyright to comments. The poster owns the copyright and licenses it to the instance. Which lets the instance use it, but not sublicense to others.

Spedwell@lemmy.world · 1 year ago

The current assumption made by these companies is that AI training is fair use, and is therefore legal regardless of license. There are still many ongoing court cases over this, but one case was already resolved in favor or the fair use position.

jackalope@lemmy.ml · 1 year ago

I don’t think you can use gpl for anything but code. Creative commons license would be more appropriate.