The Internet Archive just lost its appeal over ebook lending

fossilesque@mander.xyz · edit-2 1 year ago

The Internet Archive just lost its appeal over ebook lending

DrCake@lemmy.world · 1 year ago

So when’s the ruling against OpenAI and the like using the same copyrighted material to train their models

irotsoma@lemmy.world · 1 year ago

But OpenAI not being allowed to use the content for free means they are being prevented from making a profit, whereas the Internet Archive is giving away the stuff for free and taking away the right of the authors to profit. /s

Disclaimer: this is the argument that OpenAI is using currently, not my opinion.

norimee@lemmy.world · edit-2 1 year ago

Ah, I see you got that all wrong.

Open IA AI uses that content to generate billions in profit on the backs of The People. The Internet Archive just does it for the good of The People.

We can’t have that. “Good for The People” is not how the economy works, pal. We need profit and exploitation for the world to work…

v_krishna@lemmy.ml · 1 year ago

OpenAI is burning billions of dollars not making profit.

Agret@lemmy.world · 1 year ago

Sounds like they are operating the same as all the other big tech companies then

buddascrayon@lemmy.world · 1 year ago

Wrong

https://futurism.com/the-byte/openai-copyrighted-material-parliament

v_krishna@lemmy.ml · 1 year ago

Eh? That article says nothing about their profit margins. Today they have something like $3.5B in ARR (not really, that’s annualized from their latest peak, in Feb they had like $2B ARR). Meanwhile they have operating costs over $7B. Meaning they are losing money hand over fist and not making a profit.

I’m not suggesting anything else, just that they are not profitable and personally I don’t see a road to profitability beyond subsidizing themselves with investment.

buddascrayon@lemmy.world · 1 year ago

It’s in the first bloody paragraph. 😮‍💨

OpenAI is begging the British Parliament to allow it to use copyrighted works because it’s supposedly “impossible” for the company to train its artificial intelligence models — and continue growing its multi-billion-dollar business — without them.

And if you follow the link the title of the article says it all:

#OpenAI is set to see its valuation at $80 billion—making it the third most valuable startup in the world

v_krishna@lemmy.ml · 1 year ago

I take it you don’t understand how startups work?

OpenAI is not making any profit and is losing money hand over fist today. Valuation and raising investment rounds isn’t profit.

finitebanjo@lemmy.world · 1 year ago

I think you accidentally swapped OpenAI and Open IA which happens to initialize Internet Archive, a little confusing.

norimee@lemmy.world · 1 year ago

I didn’t even realise. Thank you for pointing it out, I fixed it.

Anyolduser@lemmynsfw.com · 1 year ago

Hot on the heels of this one, I’d imagine.

iAmTheTot@sh.itjust.works · 1 year ago

Fat chance. Line must go up.

shrugs@lemmy.world · 1 year ago

So, let’s say we create an llm that will be fed will all the copyrighted data and we design it, so that it recalls the originals when asked?! Does that count as piracy or as the kind of legal shananigans openai is doing?

wizblizz@lemmy.world · 1 year ago

Aaaaaany minute now.

PriorityMotif@lemmy.world · 1 year ago

deleted by creator

Gsus4@mander.xyz · edit-2 1 year ago

The matter is not LLMs reproducing what they have learned, it is that they didn’t pay for the books they read, like people are supposed to do legally.

This is not about free use, this is about free access, which at the scale of an individual reading books is marketed as “piracy”…at the scale of reading all books known to man…it’s onmipiracy?

We need some kind of deal where commercial LLMs have to pay a rent to a fund that distributes that among creators or remain nonprofit, which is never gonnna happen, because it’ll be a bummer for all the grifters rushing into that industry.

PriorityMotif@lemmy.world · 1 year ago

I think we need to re-examine what copyright should be. There’s nothing inherently immoral about “piracy” when the original creator gets almost nothing for their work after the initial release.

barsoap@lemm.ee · 1 year ago

it is that they didn’t pay for the books they read, like people are supposed to do legally.

If I can read a book from a library, why shouldn’t OpenAI or anybody else?

…but yes from what I’ve heard they (or whoever, don’t remember) actually trained on libgen. OpenAI can be scummy without the general process of feeding AI books you only have read access to being scummy.

General_Effort@lemmy.world · 1 year ago

Meta is defending because they trained on books3 which contained all of Bibliotik. https://en.wikipedia.org/wiki/The_Pile_(dataset)

Gsus4@mander.xyz · edit-2 1 year ago

This is not like reading a book from a library…unless you want to force the LLM to only train one book per day and keep no copies after that day.

barsoap@lemm.ee · 1 year ago

They don’t keep copies and learning speed? Why one day? Does it count if I skim through a book?

Gsus4@mander.xyz · 1 year ago

deleted by creator

index@sh.itjust.works · 1 year ago

stop asking questions and go back to work