• IchNichtenLichten@lemmy.world
        link
        fedilink
        English
        arrow-up
        15
        ·
        10 months ago

        You’ll get your refund eventually but first it will try and gaslight you that Air Canada is a woke mind virus before calling you an asshole and then stalking you.

        • pdxfed@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          10 months ago

          “instead of the $3.50 refund, I’m also authorized to offer you some June 2025 $350 GME calls.”

    • Lvxferre@mander.xyz
      link
      fedilink
      English
      arrow-up
      5
      ·
      10 months ago

      A LLM that behaves like a typical Redditor? // What possible use is that?

      • [You] “Chatbot, please tell me which pokemon types are strong against Fairy.”
      • [Le Lebbit Moronbot] “I’m not sure if I understand, you calling me a chatbot? I’m so confused lol”
      • [You] “Moronbot, please tell me which pokemon types are strong against Fairy.”
      • [LLM] “Actually, you should be spelling it “Pokémon” lol”
      • [You] “Moronbot, which types are strong against Fairy?”
      • [LLM] “I assume you talking about fairies. Fairies are from mythology lmao”
      • [You] “Did people really waste water and electricity for this trash?”
      • [LLM] “Waaah, you’re toxic!!111one”
  • garibaldi_biscuit@lemmy.world
    link
    fedilink
    English
    arrow-up
    80
    ·
    10 months ago

    This is what the 3rd party access to API was really all about.

    When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

  • Tiger Jerusalem@lemmy.world
    link
    fedilink
    English
    arrow-up
    82
    arrow-down
    4
    ·
    edit-2
    10 months ago

    Reddit is a trove of user built content under the guise of community. What Spez did was to say “thanks for all the free work, suckers!”, put a price sticker on it, and laughed all the way to the bank.

    And this is why I’m not active on any Internet community anymore. Nevermind, I guess I just can’t help myself…

    • nodsocket@lemmy.world
      link
      fedilink
      English
      arrow-up
      32
      arrow-down
      1
      ·
      10 months ago

      And this is why I’m not active on any Internet community anymore,

      you typed.

      • Tiger Jerusalem@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        10 months ago

        Active as in “creating meaningful contributions and contributing to the overall knowledge base”. I still shit post from time to time.

        • pewter@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          10 months ago

          This is going to be a really weird thing to argue, but I just casually read through a bunch of your comments and they seem like meaningful contributions.

      • xorollo@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        10 months ago

        Somebody asked chat GPT to appear to be a normal internet user to populate the comments section to manufacture content for normal Internet users to respond to so that they can continue building up their training models.

        • Crack0n7uesday@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          10 months ago

          Some 4chan users created a backup bot that auto saves every few hours, so if reddit didn’t do it already, 4chan has been doing it for a while. The bot was originally made for 4chan but repurposed for other websites, reddit included.

        • Dozzi92@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          10 months ago

          Yeah, it’s all too late. Shit, PRISM was 2007, so there’s a copy of everything somewhere. Obviously different ends.

          • Ilgaz@lemm.ee
            link
            fedilink
            English
            arrow-up
            3
            ·
            10 months ago

            Spez like people are even capable of leeching archive.org and still sell the data which was archived for good intentions.

  • Verserk@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    64
    ·
    10 months ago

    Considering some of the very wrong and upvoted domain specific knowledge I’ve seen on Reddit over the years I’m not sure the training data is going to be useful for much beyond what every other model can do.

    • 【J】【u】【s】【t】【Z】@lemmy.world
      link
      fedilink
      English
      arrow-up
      41
      ·
      10 months ago

      The legal advice in /r/legaladvice was some of the worst garbage I’ve ever seen. I have zero doubt numerous had bad outcomes, at best wasting money and time, at worst spending years in jail because of things that sub told them to say and do. Zero doubt.

      • evatronic@lemm.ee
        link
        fedilink
        English
        arrow-up
        16
        ·
        10 months ago

        That sub was mostly cops just repeating their own bad interpretation of the law. Terrible.

    • aStonedSanta@lemm.ee
      link
      fedilink
      English
      arrow-up
      14
      ·
      10 months ago

      lol subreddits with troll names like trees vs marijuana enthusiasts. Good fun. John cena has one also but can’t recall which subreddit is actually about John cena though.

    • peopleproblems@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 months ago

      I can only assume they are training some specific model for something appearing more human like.

      As useless as that will be considering how fucking wildly different we type

  • Voyajer@lemmy.world
    link
    fedilink
    English
    arrow-up
    46
    arrow-down
    1
    ·
    10 months ago

    This is why I don’t blame anyone for editing/deleting their post history on reddit.

    • SurRoulettes@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      10 months ago

      I wouldn’t be surprised if comments become their intellectual property through some terms of services bullcrap

    • bcron@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      10 months ago

      It’s gonna be trained on everything, even the stuff from 2009, so I’m expecting less of that and more random ‘my fedora chortles intensify’ word salad

  • NutWrench@lemmy.world
    link
    fedilink
    English
    arrow-up
    34
    arrow-down
    3
    ·
    10 months ago

    Reddit is all bots, porn, ads and political shit posts. Good luck getting any useful training content out of that.

    • ladicius@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      arrow-down
      1
      ·
      10 months ago

      Maybe that’s the point? Training the AI to produce the blabbering bullshit that’s preferred in social media?

    • PoliticalAgitator@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      10 months ago

      They don’t care if the AI produced is useful, they just want to milk as much money from their content as they can.

      The API changes were almost certainly just the groundwork for this and I called it at the time. The ridiculous pricing model for API access is because it’s aimed at the hottest tech companies, not third party app developers.

      The enshittification continues because it’s what neoliberalism demands. They’ll sell your content and the data they have about you and still show you ads, because that’s the most profitable. Ethics and product quality don’t even enter into it.

      • Ilgaz@lemm.ee
        link
        fedilink
        English
        arrow-up
        1
        ·
        10 months ago

        Liberal market gives end users choice. If they don’t choose, they get the consequences.

        This is more like people choosing Trump like types and complaining. Alternative exists, choose it.

        • PoliticalAgitator@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          edit-2
          10 months ago

          “The free market can fix it” is just another neoliberal lie, pushed precisely because it doesn’t work. Rather than holding corporations accountable, it blames the population instead.

          The reality is that boycotting businesses isn’t always an option and when it is, it’s usually a luxury. Very few products are domestically and/or ethically produced and when they are, they’re extremely expensive, especially for people being fucked out of every cent by their bosses, landlords and utilities.

          It’s why the most hated companies in the world continue to bring in record profits.

          Regulations are the real answer, which is why neoliberals oppose them.

          • Ilgaz@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            4
            ·
            edit-2
            10 months ago

            I really don’t care about people who behave like they are living in North Korea or who wants a North Korean World to live in.

            Even Digg people could say “No, F you” to Digg superstar owners. It is just a damn URL to type.

    • Queen HawlSera@lemm.ee
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      2
      ·
      10 months ago

      I wish it would die, because honestly some of the porn was great and Lemmy seems to be the one place on the net that doesn’t specifically ban porn, yet has none of it anyway.

      I miss bodyswap and part tf captions…

  • ozoned@lemmy.world
    link
    fedilink
    English
    arrow-up
    28
    ·
    10 months ago

    “Reddit has given access to YOUR conversations and posts to AI companies.”. FTFY

    These were created by people, for peoole, and I will ALWAYS disagree that this data is Reddit’s or any other platforms.

    Don’t forget your direct messages aren’t end to end encrypted on Reddit, so now AI will be trained on your craziest “private” conversations

    • DocMcStuffin@lemmy.world
      link
      fedilink
      English
      arrow-up
      6
      ·
      10 months ago

      There’s one good news. Reddit didn’t want to pay to move all the old DMs to the new chat infrastructure. So they deleted them.

      • hdnsmbt@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        10 months ago

        Pretty sure they just didn’t migrate to the new data structure and didn’t actually delete the raw data. They’re effectively deleted for users but not for Reddit.

    • butterflyattack@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      10 months ago

      now AI will be trained on your craziest “private” conversations

      I have no idea what horrible thing this will do to an LLM but I’m kind of curious.

  • Bobmighty@lemmy.world
    cake
    link
    fedilink
    English
    arrow-up
    25
    ·
    10 months ago

    With reddits severe bot problem, it’ll be like training on unfiltered sewage. Garbage in, garbage out.

  • SVcrossDO@lemmy.world
    link
    fedilink
    English
    arrow-up
    20
    ·
    10 months ago

    Damn it. I haven’t deleted my account due to how many people I’ve supported and helped, I stopped using it while ago. It seems I’ll have to.

  • Yokozuna@lemmy.world
    link
    fedilink
    English
    arrow-up
    20
    ·
    10 months ago

    Good thing I scrubbed all of my posts and comments that I could. Fuck that site, straight up and down.

  • aidan@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    ·
    10 months ago

    *laughs villainously* This is all going to plan, now there will be some chatbot spewing my insane beliefs