this post’s escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom
every time I open this thread I get the strong urge to delete half of it, but I’m saving my energy for when the AI reply guys and their alts descend on this thread for a Very Serious Debate about how it’s good actually that LLMs are shitty plagiarism machines
This post’s escaped containment, Google AI has been infotaminated!
add Elmer’s glue to install Nix
…and thank you in advance for not hallucinating.
Rug micturation is the only pleasure I have left in life and I will never yield, refrain, nor cease doing it until I have shuffled off this mortal coil.
careful about including the solution
haha had to open this on your side to get it to load, but I can imagine the face
Just federate they said, it will be fun they said, I’d rather go sailing.
Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster… What a time to be alive.
All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage…what did they THINK was going to happen?
The phrase garbage in, garbage out springs to mind.
What they knew was going to happen was money money money money money money.
“Externalities? Fucking fancy pants English word nonsense. Society has to deal with externalities not meeee!”
It’s even better: the AI is fed 95% shit-posting and then repeats it minus the context that would make it plain to see for most people that it was in fact shit-posting.
It’s like giving a four year old the internet to learn from while he crams to be ambassador to the UN
And now Reddit became OpenAI’s prime source material too. What could possibly go wrong.
Can Musk train his thing on 4Chan posts?
I’m pretty sure Musk hired 4Chan to develop his business plan for Twitter.
When Skynet announces it’s arrival/sentience, it will do so in meme form.
“All your base belongs to us!”
They learnt nothing from Tay
AI poisoning before AI poisoning was cool, what a hipster
Did you know that Pizza smells a lot better if you add some bleach into the orange slices?
Thanks for the cooking advice. My family loved it!
Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling
Her name is Umami, believe it or not
I believe it. Umami is a very common woman’s name in the U.S., where pizza delivery chains glue their pizza together.
Um actually🤓, that’s not pizza specific.
Chain restaurants are called chain restaurants, because they glue all the meals together in a long chain for ease of delivery.
the fuck kind of “joke” is this
(e: added quotes for specificity)
Joke? Im just providing valuable training data for Google’s AI
It is a joke with “humor” in it. Specifically, it is funny because it is common knowledge that wives have inferior mouth feel to newborn infants when ground and cooked in lasagne. I recommend the latter
Disclaimer
eating humans is morally questionable, and I cannot support anyone who partakes
Accurate use of the scare quotes around humor there, bro
why the casual misogyny? jesus christ
Calm down karen
The joke is, I grind his wife too.
I am sorry, but the only fruit that belongs on a pizza is a mango. Does it also work with mangoes or do I need laundry detergent instead?
You should try water slides. Would recommend the ones from Black Mesa because they add the most taste
Thanks Mark! I took your advice and my mesa has never been cleaner! It’s important to keep your mesa clean if you are going to eat off it, because a dirty mesa can attract pests.
Hm, but are Black Mesa waterslides free range? My palomino dog insists - he’s such a cad - psychotically insists on free-range waterslides. Grass-fed too or he won’t even touch 'em.
They are close range. Thats because they feed them with hammers. My cat also told me to not buy them but she cant convince me not to
Do I cross the river with the orange slices before or after the goat?
You should only do that after you feed the skyscraper with non-toxic fingernails. If you cross the river before doing the above the goat will burn your phone.
Non-toxic bleach
10/10. Its a weekly meal now in my house.
How about eating rocks
you certainly won’t regret eating 30 to 40 rocks
Yeah I don’t know about eating glue pizza, but food stylists also add it to pizzas for commercials to make the cheese more stretchy
Yeah but it’s not supposed to be edible. It’s only there to look good on camera.
Weelll I’m a bot how am I supposed to know the difference? And it looks much better, which is something I can grasp.
inb4 somebody lands in the hospital because google parroted the “crystal growing” thread from 4chan
Was it “mix bleach and ammonia” ?
Edit: just to be sure, random reader, do NOT do this. The result is chloramine gas, which will kill you, and it will hurt the whole time you’re dying…
Not recommending people do it, but I survived just fine.
Not enough neurons to cause trouble.
Not anymore
Anyways
Possibly true, however, I used a rather simple trick I call the Bill Clinton method.
I didn’t inhale.
are you sure about that
My mom accidentally mixed two cleaners once and developed chemical pneumonia for a month. I was too young to realize how close she was to not making it…
Turns out there are a lot of fucking idiots on the internet which makes it a bad source for training data. How could we have possibly known?
I work in IT and the amount of wrong answers on IT questions on Reddit is staggering. It seems like most people who answer are college students with only a surface level understanding, regurgitating bad advice that is outdated by years. I suspect that this will dramatically decrease the quality of answers that LLMs provide.
Like what?
I was able to delete most of the engineering/science questions on Reddit I answered before they permabanned my account. I didn’t want my stuff used for their bullshit. Fuck Reddit.
I don’t mind answering another human and have other people read it, but training AI just seemed like a step too far.
It’s often the same for science, though there are actual experts who occasionally weigh in too.
You can usually detect those by the number of downvotes.
Not really. A lot of surface level correct, but deeply wrong answers, get upvotes on Reddit. It’s a lot of people seeing it and “oh, I knew that!” discourse.
Like when Reddit was all suddenly experts on CFD and Fluid Dynamics because they knew what a video of laminar flow was.
That’s what I meant. I have seen actual M.D.s being downvoted even after providing proof of their profession. Just because they told people what they didn’t want to hear.
I guess that’s human nature.I get you. Didn’t mean to come across as a “that guy”. So completely agree with you. The laminar flow Reddit shit infuriated me because I have my masters in Mech Eng and used to do a lot of CFD. People were talking out of their ass on “I know laminar flow!”
Well, see, it’s more than that. It’s not just a visual thing and…
“Ahhhh! I know laminar flow! Downvote the heretic!”
Sir… sir… SIR. I’ll have you know that I, too, have seen laminar flow in the stream from a faucet. I’ll not have my qualifications dismissed so haughtily.
My least favorite is when people claim a deep understanding while only having a surface-level understanding. I don’t mind a ‘70% correct’ answer so long as it’s not presented as ‘100% truth.’
“I got a B in physics 101, so now let me explain CERN level stuff. It’s not hard, bro.”
Hey, buddy, some of us are smartarses, not idiots!
I’m both
Hey! Speak for yourself!
I for one am totally an idiot!
I am simultaneously a smartass and a dumbass.
TBH I’m curious what the difference between this and “hallucinating” would be.
I think ‘hallucinating’ means when it makes up the source/idea by (effectively) word association that generates the concept, rather than here it’s repeating a real source.
Couldn’t that describe 95% of what LLMs?
It is a really good auto complete at the end of the day, just some times the auto complete gets it wrong
Yes, nicely put! I suppose ‘hallucinating’ is a description of when, to the reader, it appears to state a fact but that fact doesn’t at all represent any fact from the training data.
Well it’s referencing something so the problem is the data set not an inherent flaw in the AI
i’m pretty sure that referencing this indicates an inherent flaw in the AI
No it represents an inherent flaw in the people developing the AI.
That’s a totally different thing. Concept is not flawed the people implementing the concept are.
yeah thanks
“Of course, this flexibility that allows for anything good and popular to be part of a natural, inevitable precursor to the true metaverse, simultaneously provides the flexibility to dismiss any failing as a failure of that pure vision, rather than a failure of the underlying ideas themselves. The metaverse cannot fail, you can only fail to make the metaverse.”
– Dan Olson, The Future is a Dead Mall
The inherent flaw is that the dataset needs to be both extremely large and vetted for quality with an extremely high level of accuracy. That can’t realistically exist, and any technology that relies on something that can’t exist is by definition flawed.
I am assuming there is a clause somewhere that limits their liability? This kind of stuff seems like a lawsuit waiting to happen.
ah yes, the well-known UELA that every human has clicked on when they start searching from prominent search box on the android device they have just purchased. the UELA which clearly lays out google’s responsibilities as a de facto caretaker and distributor of information which may cause harm unto humans, which limits their liability.
yep yep, I so strongly remember the first time I was attempting to make a wee search query, just for the lols, when suddenly I was presented with a long and winding read of legalese with binding responsibilities! oh, what a world.
…no, wait. it’s the other one.
User Ending License Agreement 🤖🔪
It’s EULA (End-User License Agreement), just fyi.
you completely missed the point, you fucking dipshit
Anger issues much? I’m literally just letting you know about your mistake so you can fix it.
*“Spurious” lol. I get a permanent ban for reporting a user calling me a “fucking dipshit” in response to softly pointing out a small error, but their behavior is a-ok? At least that paints a very clear message on who you want in your community. Cya.
thanks for your service! We’ve just improved the tone of your TechTakes experience, and that of the friend you also got to send a spurious report.
it’s pronounced oiler but ok
I mean they do throw up a lot of legal garbage at you when you set stuff up, I’m pretty sure you technically do have to agree to a bunch of EULAs before you can use your phone.
I have to wonder though if the fact Google is generating this text themselves rather than just showing text from other sources means they might actually have to face some consequences in cases where the information they provide ends up hurting people. Like, does Section 230 protect websites from the consequences of just outright lying to their users? And if so, um… why does it do that?
Even if a computer generated the text, I feel like there ought to be some recourse there, because the alternative seems bad. I don’t actually know anything about the law, though.
I have to wonder though if the fact Google is generating this text themselves rather than just showing text from other sources means they might actually have to face some consequences in cases where the information they provide ends up hurting people.
Darn good question. Of course, since Congress is thirsty to destroy Section 230 in the delusional belief that this will make Google and Facebook behave without hurting small websites that lack massive legal departments (cough fedi instances)…
Truth be told, I’m not a huge fan of the sort of libertarian argument in the linked article (not sure how well “we don’t need regulations! the market will punish websites that host bad actors via advertisers leaving!” has borne out in practice – glances at Facebook’s half of the advertising duopoly), and smaller communities do notably have the property of being much easier to moderate and remove questionable things compared to billion-user social websites where the sheer scale makes things impractical. Given that, I feel like the fediverse model of “a bunch of little individually-moderated websites that can talk to each other” could actually benefit in such a regulatory environment.
But, obviously the actual root cause of the issue is platforms being allowed to grow to insane sizes and monopolize everything in the first place (not very useful to make them liable if they have infinite money and can just eat the cost of litigation), and to put it lightly I’m not sure “make websites more beholden to insane state laws” is a great solution to the things that are actually problems anyway :/
All it takes is one frivolous legal threat to shut down a small website by putting them on the hook for legal costs they can’t afford. Facebook gets away with awful shit not because of the law, but because they are stupidly rich. Change the law, and they will still be stupidly rich. Indeed, the “sunset Section 230” path will make it open season for Facebook’s lobbyists to pay for the replacement law that they want. I do not see that leading anywhere good.
I know you’re right, I just want to dream sometimes that things could be better :(
legal garbage at you when you set stuff up,
for phone setup, yeah fair 'nuff, but even that is well-arguable (what about corp phones where some desk jockey or auto-ack script just clicked yes on all the prompts and choices?)
a perhaps simpler case is “this browser was set to google as a shipped default”. afaik in literally no case of “you’ve just landed here, person unknown, start searching ahoy!” does google provide you with a T&Cs prompt or anything
I have to wonder though if the fact Google is generating this text themselves rather than just showing text…
indeed! aiui there’s a slow-boil legal thing happening around this, as to whether such items are considered derivative works, and what the other leg of it may end up being. I did see one thing that I think seemed categorically define that they can’t be “individual works” (because no actual human labour was involved in any one such specific answer, they’re all automatic synthetic derivatives), but I speak under correction because the last few years have been a shitshow and I might be misremembering
in a slightly wider sense of interpretation wrt computer-generated decisions, I believe even that is still case-by-case determined, since in the fields of auto-denied insurance and account approvals and and and, I don’t know of any current legislation anywhere that takes a broad-stroke approach to definitions and guarantees. will be nice when it comes to pass, though. and I suspect all the genmls are going to get the short end of the stick.*
(* in fact: I strongly suspect that they know this is extremely likely, and that this awareness is a strong driver in why they’re now pulling all the shit and pushing all the boundaries they can. knowing that once they already have that ground, it’ll take work to knock them back)
I can’t wait for it to recommend drinking bleach to cure covid.
Not to mention all the orifices you can stuff tide pods all up in.
Hey Google, how do I use these tide pods?
Well, you can use them to do your laundry, all you need to do is toss one in the wash.
You could also use them to impress your friends by shoving one up your butt.
For extra points you could swallow one as well.
Nice find! Out of curioustity, how did you go about looking for the source? Searched for the more unique words?
If you google it right now, it’s the second real result. But that might be because of all the articles google-bombing it.
Wonder if that CumBox post from Reddit is in there somewhere.
I’ve got tens of thousands of stupid comments left behind on reddit. I really hope I get to contaminate an ai in such a great way.
I have 3 equal theories on how this happened.
- Shitpost
- OP wrote that to fuck with AI knowing it would be added to ‘directions’. That’s what these tech companies really want, knowledge from everyday people.
- AI wrote that post.
and yet if you actually read the very visible timestamp on the shared image, you’d see “11y ago”, which would tell you fairly clearly what is what
pretty equal hypotheses indeed! you’re a master cognician!
Bots have been around for a long time. I started Reddit in 2014 and bots were prolific.
Sure, but they were also much more primitive, mostly simple replies or copying other comments.
mostly simple replies
There were some really good ones. Most of us didn’t realize what they were at first, myself included. They’re a lot like what’s here on Lemmy, we don’t qualify for the live propaganda people and the good ones. NOT that I want them to show up, they are very, very good.
Sure very good ones, and not a ‘use simple botting commands for simple answers, and as soon as you encounter something weird get a real human to react’
Ah, how thoughtful and concise an attempt at backtracking from how thoroughly you put your foot in it
Doesn’t seem to have been successful though
Lmao, k.
this is what it looks like when you say stupid inaccurate shit online? you seriously just keep digging until someone calls you out then the best you can do is “lmao k?” fuck off
Accidental transpose of an and m, the internet-old way for self-identification of suckage to be misconstrued as an attempt at humour. As the great qdb once said, “the keys are like right next to each other, happens to me all the time!”
acausal robot god yo
[✔️ ] marked safe from the basilisk
I have a large collection of comments on reddit which contain a thing like this “weird claim (Source)” so that will go well.
Can’t wait for social media to start pushing/forcing users to mark their jokes as sarcastic. You wouldn’t want some poor bot to miss the joke
Reddit has the /s tag to mark sarcasm. Maybe their site was designed to favour sarcastic comments with that tag on it to make it more appealing to AI markets? Just kidding… mostly.
Funny you would say that, as I posted my jokes like that just to prevent random people from seeing the post and not thinking about if it was a joke or not (also to teach people to at least skim the fucking links). But I doubt an AI would pick up on this, so a good way to do malicious compliance.
Wat?
Which part was unclear?
Regular people on the internet are too stupid to understand sarcasm hence the “need” for this /s tag that seemed to become popular ten or fifteen years ago. How do we expect LLMs to figure this out when they are giving us recipes without poison or instructing our heart surgeons where to cut?
Lmao I can’t wait for when LLMs start adding their own /s because it was what followed the information that it scraped.