Source

I see Google’s deal with Reddit is going just great…

  • David Gerard@awful.systems
    shield
    M
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    this post’s escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom

  • Hemingways_Shotgun@lemmy.ca
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster… What a time to be alive.

    All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage…what did they THINK was going to happen?

    The phrase garbage in, garbage out springs to mind.

    • Oha@lemmy.ohaa.xyz
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Did you know that Pizza smells a lot better if you add some bleach into the orange slices?

        • Oha@lemmy.ohaa.xyz
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling

            • Monument@lemmy.sdf.org
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              I believe it. Umami is a very common woman’s name in the U.S., where pizza delivery chains glue their pizza together.

              • anton@lemmy.blahaj.zone
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 year ago

                Um actually🤓, that’s not pizza specific.

                Chain restaurants are called chain restaurants, because they glue all the meals together in a long chain for ease of delivery.

      • derpgon@programming.dev
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        I am sorry, but the only fruit that belongs on a pizza is a mango. Does it also work with mangoes or do I need laundry detergent instead?

        • Oha@lemmy.ohaa.xyz
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          1 year ago

          You should try water slides. Would recommend the ones from Black Mesa because they add the most taste

          • trev likes godzilla@beehaw.org
            link
            fedilink
            English
            arrow-up
            0
            ·
            1 year ago

            Thanks Mark! I took your advice and my mesa has never been cleaner! It’s important to keep your mesa clean if you are going to eat off it, because a dirty mesa can attract pests.

          • voracitude@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            1 year ago

            Hm, but are Black Mesa waterslides free range? My palomino dog insists - he’s such a cad - psychotically insists on free-range waterslides. Grass-fed too or he won’t even touch 'em.

            • Oha@lemmy.ohaa.xyz
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              They are close range. Thats because they feed them with hammers. My cat also told me to not buy them but she cant convince me not to

        • Oha@lemmy.ohaa.xyz
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          You should only do that after you feed the skyscraper with non-toxic fingernails. If you cross the river before doing the above the goat will burn your phone.

  • Kerb@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    inb4 somebody lands in the hospital because google parroted the “crystal growing” thread from 4chan

  • CileTheSane@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    Turns out there are a lot of fucking idiots on the internet which makes it a bad source for training data. How could we have possibly known?

    • Kit@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      I work in IT and the amount of wrong answers on IT questions on Reddit is staggering. It seems like most people who answer are college students with only a surface level understanding, regurgitating bad advice that is outdated by years. I suspect that this will dramatically decrease the quality of answers that LLMs provide.

      • Ragnarok314159@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        I was able to delete most of the engineering/science questions on Reddit I answered before they permabanned my account. I didn’t want my stuff used for their bullshit. Fuck Reddit.

        I don’t mind answering another human and have other people read it, but training AI just seemed like a step too far.

      • WhatIsH2O4@lemmy.ml
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        It’s often the same for science, though there are actual experts who occasionally weigh in too.

          • Ragnarok314159@sopuli.xyz
            link
            fedilink
            English
            arrow-up
            0
            ·
            1 year ago

            Not really. A lot of surface level correct, but deeply wrong answers, get upvotes on Reddit. It’s a lot of people seeing it and “oh, I knew that!” discourse.

            Like when Reddit was all suddenly experts on CFD and Fluid Dynamics because they knew what a video of laminar flow was.

            • Joe Cool@lemmy.ml
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              That’s what I meant. I have seen actual M.D.s being downvoted even after providing proof of their profession. Just because they told people what they didn’t want to hear.
              I guess that’s human nature.

              • Ragnarok314159@sopuli.xyz
                link
                fedilink
                English
                arrow-up
                0
                ·
                1 year ago

                I get you. Didn’t mean to come across as a “that guy”. So completely agree with you. The laminar flow Reddit shit infuriated me because I have my masters in Mech Eng and used to do a lot of CFD. People were talking out of their ass on “I know laminar flow!”

                Well, see, it’s more than that. It’s not just a visual thing and…

                “Ahhhh! I know laminar flow! Downvote the heretic!”

        • TheOakTree@beehaw.org
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          My least favorite is when people claim a deep understanding while only having a surface-level understanding. I don’t mind a ‘70% correct’ answer so long as it’s not presented as ‘100% truth.’

    • milicent_bystandr@lemm.ee
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      I think ‘hallucinating’ means when it makes up the source/idea by (effectively) word association that generates the concept, rather than here it’s repeating a real source.

      • PersonalDevKit@aussie.zone
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        1 year ago

        Couldn’t that describe 95% of what LLMs?

        It is a really good auto complete at the end of the day, just some times the auto complete gets it wrong

        • milicent_bystandr@lemm.ee
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          Yes, nicely put! I suppose ‘hallucinating’ is a description of when, to the reader, it appears to state a fact but that fact doesn’t at all represent any fact from the training data.

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      Well it’s referencing something so the problem is the data set not an inherent flaw in the AI

        • Echo Dot@feddit.uk
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          No it represents an inherent flaw in the people developing the AI.

          That’s a totally different thing. Concept is not flawed the people implementing the concept are.

          • ebu@awful.systems
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            1 year ago

            “Of course, this flexibility that allows for anything good and popular to be part of a natural, inevitable precursor to the true metaverse, simultaneously provides the flexibility to dismiss any failing as a failure of that pure vision, rather than a failure of the underlying ideas themselves. The metaverse cannot fail, you can only fail to make the metaverse.”

            – Dan Olson, The Future is a Dead Mall

      • Ultraviolet@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        1 year ago

        The inherent flaw is that the dataset needs to be both extremely large and vetted for quality with an extremely high level of accuracy. That can’t realistically exist, and any technology that relies on something that can’t exist is by definition flawed.

  • Klanky@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    I am assuming there is a clause somewhere that limits their liability? This kind of stuff seems like a lawsuit waiting to happen.

    • froztbyte@awful.systems
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      ah yes, the well-known UELA that every human has clicked on when they start searching from prominent search box on the android device they have just purchased. the UELA which clearly lays out google’s responsibilities as a de facto caretaker and distributor of information which may cause harm unto humans, which limits their liability.

      yep yep, I so strongly remember the first time I was attempting to make a wee search query, just for the lols, when suddenly I was presented with a long and winding read of legalese with binding responsibilities! oh, what a world.

      …no, wait. it’s the other one.

      • Ech@lemm.ee
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        It’s EULA (End-User License Agreement), just fyi.

          • Ech@lemm.ee
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            1 year ago

            Anger issues much? I’m literally just letting you know about your mistake so you can fix it.

            *“Spurious” lol. I get a permanent ban for reporting a user calling me a “fucking dipshit” in response to softly pointing out a small error, but their behavior is a-ok? At least that paints a very clear message on who you want in your community. Cya.

            • David Gerard@awful.systemsM
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              thanks for your service! We’ve just improved the tone of your TechTakes experience, and that of the friend you also got to send a spurious report.

      • 200fifty@awful.systems
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        1 year ago

        I mean they do throw up a lot of legal garbage at you when you set stuff up, I’m pretty sure you technically do have to agree to a bunch of EULAs before you can use your phone.

        I have to wonder though if the fact Google is generating this text themselves rather than just showing text from other sources means they might actually have to face some consequences in cases where the information they provide ends up hurting people. Like, does Section 230 protect websites from the consequences of just outright lying to their users? And if so, um… why does it do that?

        Even if a computer generated the text, I feel like there ought to be some recourse there, because the alternative seems bad. I don’t actually know anything about the law, though.

        • blakestacey@awful.systems
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          I have to wonder though if the fact Google is generating this text themselves rather than just showing text from other sources means they might actually have to face some consequences in cases where the information they provide ends up hurting people.

          Darn good question. Of course, since Congress is thirsty to destroy Section 230 in the delusional belief that this will make Google and Facebook behave without hurting small websites that lack massive legal departments (cough fedi instances)…

          • 200fifty@awful.systems
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            1 year ago

            Truth be told, I’m not a huge fan of the sort of libertarian argument in the linked article (not sure how well “we don’t need regulations! the market will punish websites that host bad actors via advertisers leaving!” has borne out in practice – glances at Facebook’s half of the advertising duopoly), and smaller communities do notably have the property of being much easier to moderate and remove questionable things compared to billion-user social websites where the sheer scale makes things impractical. Given that, I feel like the fediverse model of “a bunch of little individually-moderated websites that can talk to each other” could actually benefit in such a regulatory environment.

            But, obviously the actual root cause of the issue is platforms being allowed to grow to insane sizes and monopolize everything in the first place (not very useful to make them liable if they have infinite money and can just eat the cost of litigation), and to put it lightly I’m not sure “make websites more beholden to insane state laws” is a great solution to the things that are actually problems anyway :/

            • blakestacey@awful.systems
              link
              fedilink
              English
              arrow-up
              0
              ·
              edit-2
              1 year ago

              All it takes is one frivolous legal threat to shut down a small website by putting them on the hook for legal costs they can’t afford. Facebook gets away with awful shit not because of the law, but because they are stupidly rich. Change the law, and they will still be stupidly rich. Indeed, the “sunset Section 230” path will make it open season for Facebook’s lobbyists to pay for the replacement law that they want. I do not see that leading anywhere good.

        • froztbyte@awful.systems
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          1 year ago

          legal garbage at you when you set stuff up,

          for phone setup, yeah fair 'nuff, but even that is well-arguable (what about corp phones where some desk jockey or auto-ack script just clicked yes on all the prompts and choices?)

          a perhaps simpler case is “this browser was set to google as a shipped default”. afaik in literally no case of “you’ve just landed here, person unknown, start searching ahoy!” does google provide you with a T&Cs prompt or anything

          I have to wonder though if the fact Google is generating this text themselves rather than just showing text…

          indeed! aiui there’s a slow-boil legal thing happening around this, as to whether such items are considered derivative works, and what the other leg of it may end up being. I did see one thing that I think seemed categorically define that they can’t be “individual works” (because no actual human labour was involved in any one such specific answer, they’re all automatic synthetic derivatives), but I speak under correction because the last few years have been a shitshow and I might be misremembering

          in a slightly wider sense of interpretation wrt computer-generated decisions, I believe even that is still case-by-case determined, since in the fields of auto-denied insurance and account approvals and and and, I don’t know of any current legislation anywhere that takes a broad-stroke approach to definitions and guarantees. will be nice when it comes to pass, though. and I suspect all the genmls are going to get the short end of the stick.*

          (* in fact: I strongly suspect that they know this is extremely likely, and that this awareness is a strong driver in why they’re now pulling all the shit and pushing all the boundaries they can. knowing that once they already have that ground, it’ll take work to knock them back)

      • NotMyOldRedditName@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        Hey Google, how do I use these tide pods?

        Well, you can use them to do your laundry, all you need to do is toss one in the wash.

        You could also use them to impress your friends by shoving one up your butt.

        For extra points you could swallow one as well.

  • Coldus12@reddthat.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    Nice find! Out of curioustity, how did you go about looking for the source? Searched for the more unique words?

    • Sotuanduso@lemm.ee
      cake
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      If you google it right now, it’s the second real result. But that might be because of all the articles google-bombing it.

  • ColeSloth@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    I’ve got tens of thousands of stupid comments left behind on reddit. I really hope I get to contaminate an ai in such a great way.

    • pelespirit@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      I have 3 equal theories on how this happened.

      • Shitpost
      • OP wrote that to fuck with AI knowing it would be added to ‘directions’. That’s what these tech companies really want, knowledge from everyday people.
      • AI wrote that post.
      • froztbyte@awful.systems
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        1 year ago

        and yet if you actually read the very visible timestamp on the shared image, you’d see “11y ago”, which would tell you fairly clearly what is what

        pretty equal hypotheses indeed! you’re a master cognician!

            • pelespirit@sh.itjust.works
              link
              fedilink
              English
              arrow-up
              0
              ·
              1 year ago

              mostly simple replies

              There were some really good ones. Most of us didn’t realize what they were at first, myself included. They’re a lot like what’s here on Lemmy, we don’t qualify for the live propaganda people and the good ones. NOT that I want them to show up, they are very, very good.

              • Soyweiser@awful.systems
                link
                fedilink
                English
                arrow-up
                0
                ·
                1 year ago

                Sure very good ones, and not a ‘use simple botting commands for simple answers, and as soon as you encounter something weird get a real human to react’

          • froztbyte@awful.systems
            link
            fedilink
            English
            arrow-up
            0
            ·
            1 year ago

            Ah, how thoughtful and concise an attempt at backtracking from how thoroughly you put your foot in it

            Doesn’t seem to have been successful though

              • self@awful.systems
                link
                fedilink
                English
                arrow-up
                0
                ·
                1 year ago

                this is what it looks like when you say stupid inaccurate shit online? you seriously just keep digging until someone calls you out then the best you can do is “lmao k?” fuck off

                • froztbyte@awful.systems
                  link
                  fedilink
                  English
                  arrow-up
                  0
                  ·
                  1 year ago

                  Accidental transpose of an and m, the internet-old way for self-identification of suckage to be misconstrued as an attempt at humour. As the great qdb once said, “the keys are like right next to each other, happens to me all the time!”

      • Graphy@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        Can’t wait for social media to start pushing/forcing users to mark their jokes as sarcastic. You wouldn’t want some poor bot to miss the joke

        • DannyBoy@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          Reddit has the /s tag to mark sarcasm. Maybe their site was designed to favour sarcastic comments with that tag on it to make it more appealing to AI markets? Just kidding… mostly.

        • Soyweiser@awful.systems
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          Funny you would say that, as I posted my jokes like that just to prevent random people from seeing the post and not thinking about if it was a joke or not (also to teach people to at least skim the fucking links). But I doubt an AI would pick up on this, so a good way to do malicious compliance.

  • jaybone@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    Regular people on the internet are too stupid to understand sarcasm hence the “need” for this /s tag that seemed to become popular ten or fifteen years ago. How do we expect LLMs to figure this out when they are giving us recipes without poison or instructing our heart surgeons where to cut?

    • Asafum@feddit.nl
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      Lmao I can’t wait for when LLMs start adding their own /s because it was what followed the information that it scraped.