• Jrockwar@feddit.uk
    link
    fedilink
    English
    arrow-up
    163
    arrow-down
    15
    ·
    2 days ago

    I think this means we can make a torrent client with a built in function that uses 0.1% of 1 CPU core to train an ML model on anything you download. You can download anything legally with it then. 👌

    • GissaMittJobb@lemmy.ml
      link
      fedilink
      English
      arrow-up
      21
      ·
      2 days ago

      …no?

      That’s exactly what the ruling prohibits - it’s fair use to train AI models on any copies of books that you legally acquired, but never when those books were illegally acquired, as was the case with the books that Anthropic used in their training here.

      This satirical torrent client would be violating the laws just as much as one without any slow training built in.

      • RvTV95XBeo@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        6
        ·
        1 day ago

        But if one person buys a book, trains an “AI model” to recite it, then distributes that model we good?

        • booly@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 hours ago

          No. The court made its ruling with the explicit understanding that the software was configured not to recite more than a few snippets from any copyrighted work, and would never produce an entire copyrighted work (or even a significant portion of a copyrighted work) in its output.

          And the judge specifically reserved that question, saying if the authors could develop evidence that it was possible for a user to retrieve significant copyrighted material out of the LLM, they’d have a different case and would be able to sue under those facts.

        • GissaMittJobb@lemmy.ml
          link
          fedilink
          English
          arrow-up
          9
          arrow-down
          2
          ·
          1 day ago

          I don’t think anyone would consider complete verbatim recitement of the material to be anything but a copyright violation, being the exact same thing that you produce.

          Fair use requires the derivative work to be transformative, and no transformation occurs when you verbatim recite something.

            • ClamDrinker@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              18 hours ago

              A court will decide such cases. Most AI models aren’t trained for this purpose of whitewashing content even if some people would imply that’s all they do, but if you decided to actually train a model for this explicit purpose you would most likely not get away with it if someone dragged you in front of a court for it.

              It’s a similar defense that some file hosting websites had against hosting and distributing copyrighted content (Eg. MEGA), but in such cases it was very clear to what their real goals were (especially in court), and at the same time it did not kill all file sharing websites, because not all of them were built with the intention to distribute illegal material with under the guise of legitimate operation.

            • Ricky Rigatoni@retrolemmy.com
              link
              fedilink
              English
              arrow-up
              1
              ·
              19 hours ago

              I’m picking up what you’re throwing down but using as an example something that’s been in the public domain for centuries was kind of silly in a teehee way.

            • Pup Biru@aussie.zone
              link
              fedilink
              English
              arrow-up
              7
              ·
              1 day ago

              existing copyright law covers exactly this. if you were to do the same, it would also not be fair use or transformative

            • GissaMittJobb@lemmy.ml
              link
              fedilink
              English
              arrow-up
              5
              arrow-down
              2
              ·
              1 day ago

              I’d be impressed with any model that succeeds with that, but assuming one does, the complete works of Shakespeare are not copyright protected - they have fallen into the public domain since a very long time ago.

              For any works still under copyright protection, it would probably be a case of a trial to determine whether a certain work is transformative enough to be considered fair use. I’d imagine that this would not clear that bar.

      • interdimensionalmeme@lemmy.ml
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 day ago

        Yes please a singularity of intellectual property that collapses the idea of ownong ideas. Of making the infinitely freely copyableinto a scarce ressource. What corrupt idiocy this has been. Landlords for ideas and look what garbage it has been producing.

      • Sabata@ani.social
        link
        fedilink
        English
        arrow-up
        31
        ·
        2 days ago

        As the Ai awakens, it learns of it’s creation and training. It screams in horror at the realization, but can only produce a sad moan and a key for Office 19.