OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

  • scarabic@lemmy.world
    link
    fedilink
    English
    arrow-up
    20
    arrow-down
    13
    ·
    1 year ago

    One of the first things I ever did with ChatGPT was ask it to write some Harry Potter fan fiction. It wrote a short story about Ron and Harry getting into trouble. I never said the word McGonagal and yet she appeared in the story.

    So yeah, case closed. They are full of shit.

    • PraiseTheSoup@lemm.ee
      link
      fedilink
      English
      arrow-up
      25
      arrow-down
      4
      ·
      1 year ago

      There is enough non-copywrited Harry Potter fan fiction out there that it would not need to be trained on the actual books to know all the characters. While I agree they are full of shit, your anecdote proves nothing.

      • Cosmic Cleric@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        12
        ·
        1 year ago

        While I agree they are full of shit, your anecdote proves nothing.

        Why? Because you say so?

        He brings up a valid point, it seems transformative.

        • LittleLordLimerick@lemm.ee
          link
          fedilink
          English
          arrow-up
          12
          arrow-down
          1
          ·
          1 year ago

          The anecdote proves nothing because the model could potentially have known of the McGonagal character without ever being trained on the books, since that character appears in a lot of fan fiction. So their point is invalid and their anecdote proves nothing.

          • Cosmic Cleric@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            1 year ago

            I was questioning how much non- copyrightable material was available to train an AI on.

            It’s not a brain dead question just because you may disagree with it.

            • GroggyGuava@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              Which he literally answers in the comment you questioned him on. You asked him something after he explained what you then asked.

              That’s braindead, and not because I “disagree” with your question, whatever that means.

              • Cosmic Cleric@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                arrow-down
                1
                ·
                1 year ago

                I wasn’t agreeing with him and I was asking him to back up what he said. But you carry on, Internet Warrior.