It's so over

The Picard Maneuver@lemmy.world · 4 months ago

It's so over

MindTraveller@lemmy.ca · 4 months ago

Some disabled people have trouble with captchas, so these days you can download an extension where a robot solves the captcha for you.

Carl@sh.itjust.works · 4 months ago

I have trouble with the selecting the squares on the grid, for images that say select only crosswalks, bikes, etc. I hate that captcha. Is there a decent extension for firefox?

⸻ Ban DHMO 🇦🇺 ⸻@aussie.zone · 4 months ago

Buster can do reCAPTCHA: https://addons.mozilla.org/en-US/firefox/addon/buster-captcha-solver/

Marleyinoc@lemmy.world · 4 months ago

It got it wrong. That’s a lowercase “p.”

NegativeNull@lemmy.world · 4 months ago

DragonTypeWyvern@midwest.social · 4 months ago

Obviously not. He won a legal case for emancipation.

samus12345@lemmy.world · 4 months ago

Good thing the Federation stuck to that!

samus12345@lemmy.world · edit-2 4 months ago

“Actually there is a distinct difference between an android and a robot.”

Yup, Data did an “um, ackshully…”

considine@lemmy.ml · 4 months ago

Dead internet shall become undead.

xia@lemmy.sdf.org · 4 months ago

I seem to recall OpenAI saying they guarded against captcha solving…

jol@discuss.tchncs.de · 4 months ago

Captchas te not meant to deter all bots. It’s meant to make it ever so slightly expensive that a mass DDOS attack would be extremely expensive to perform. Think like thousand sof requests per second, all being Captcha’d and how much it costs to run AI. It’s current not a feasible solution.

There is cheaper AI that can solve Captchas though, and it’s only gonna get cheaper.

frezik@midwest.social · 4 months ago

It’s long been cheap enough that you can pay a call center full of people in a developing country to solve them for you. Going to be a while before AI is cheaper than that.

Having used them to protect a few web sites from spammers filling up forms, they do cut down on the bullshit. This makes things more convenient for the people reading the information coming in from those forms, but I sometimes wonder if it’s worth the cost of everyone else having to pick out the bicycles in the picture.

AwkwardLookMonkeyPuppet@lemmy.world · 4 months ago

They can’t solve them a million times per minute though.

Ballistic_86@lemmy.world · edit-2 4 months ago

I believe this is why Google, and a few other companies, have started using behavioral analysis to figure out if you are human. Did your mouse wonder around the page before clicking to verify? Did you come from another website as if browsing the web? What device are you using and have you used it on this site before? Are you logged into an account? I’m sure they use many more factors, but it’s something that would be hard to replicate with bot behavior on a consistent basis (for now).

brbposting@sh.itjust.works · 4 months ago

Apple and Cloudflare are using “Private Access Tokens”

Some negative implications for the open web I believe

Cethin@lemmy.zip · edit-2 4 months ago

Also, captchas are meant to gather data to train on. That’s why we used to have pictures of writing, but that’s basically solved now. It’s why we now have a lot of self driving vehicle focused ones now, like identifying busses, bikes, traffic lights/signs, and that sort of thing.

Captchas get humans to label data so the ML algorithms can train on it, eventually being able to identify the tests themselves.

AwkwardLookMonkeyPuppet@lemmy.world · 4 months ago

Now it’s making me identify developed pictures from a photo negative. I’m not quite sure what they’re going to do with that training since computers can already perform that task.

bitwolf@lemmy.one · 4 months ago

A common OCR tactic is to turn the image negative and bump the contrast to make text easier to recognize.

It could be a precursor for that step.

TheOakTree@lemm.ee · 4 months ago

Also the “select the image below containing the example image above.”

Like… we already have computers that can recognize image repetitions.

Cethin@lemmy.zip · 4 months ago

So that’s almost certainly trying to gather data to defeat data poisoning. The other image is probably slightly altered in a way you can’t detect.

Voroxpete@sh.itjust.works · 4 months ago

There’s a lot of misunderstanding in this thread about how captchas work.

What modern captchas examine isn’t actually your ability to solve the puzzle… It’s how you solve it. Things like mouse movements and how you type are big factors. So a bot would process for a moment, and then basically copy and paste in the answer, whereas as a human is going to type at a normal pace, often with pauses as they double check the details. Same goes for the click the tiles challenges. A bot will work through systematically, a human will bounce around, and their timings will be very different.

Lets_Eat_Grandma@lemm.ee · 4 months ago

Captchas have largely been solvable by machines at a rate higher than humans for a long, long time.

It is very easy to train a model to behave like humans do by simply having a sample of human inputs.

Here is an article from august 2023 covering how much better machines are than humans at accomplishing captchas of many flavors. Sauce

BluesF@lemmy.world · 4 months ago

The “puzzle” isn’t the test, the test uses your browser history, mouse activity, etc to identify you as human (or not). The puzzle is used to generate training data for ML models.

normalexit@lemmy.world · 4 months ago

Lol, well maybe not your browser history. That would be bad.

Daxtron2@startrek.website · 4 months ago

Sure with a modern captcha framework that would be true. In this case, this looks like something that was custom rolled for their site so its pretty unlikely.

BluesF@lemmy.world · 4 months ago

Oh, true, I didn’t look too close.

MystikIncarnate@lemmy.ca · 4 months ago

I’m just saying, but captcha had a purpose. It still kind of does. Whether solved by a person or by an AI.

I’m pretty sure that for a good while there it was using captcha to help its text recognition more accurately determine what words were from scans of books that were imported en masse to Google books as images of pages. We’re talking about books published before computers were used to write them. The text recognition algorithm had an idea of what the letters should be, but didn’t have a high enough confidence in the result, so it was sent through captcha to get a consensus from humans.

The humans answering the captcha would just verify whether it was one of a small list of possible matches, and in doing so, train the machine vision algorithm to better detect the letter in the future.

That’s what I heard at least. IDK. I just live here (on the internet).

BakedCatboy@lemmy.ml · 4 months ago

I’m curious if it could solve the traffic light and crosswalk ones, I would try but I’m out of free image uploads from asking it to explain memes to test its cultural knowledge.

The Picard Maneuver@lemmy.world · 4 months ago

Wow, that’s actually quite impressive.

jballs@sh.itjust.works · 4 months ago

I’m sure eventually someone will make a bot called something like ai-explains-the-joke that does this automatically.

bobaFeet@lemmy.world · 4 months ago

Expl-AI-n Bot will break down whatever joke you feed it.

TriPolarBearz@lemmy.world · 4 months ago

Expl-AI-n itself is a pun. With the letters AI in the word explain capitalized, readers can infer that artificial intelligence is being used to explain jokes.

kromem@lemmy.world · 4 months ago

The majority of people right now are fairly out of touch with the actual capabilities of modern models.

There’s a combination of the tech learning curve on the human side as well as an amplification of stories about the 0.5% most extreme failure conditions by a press core desperate to feature how shitty the technology they are terrified of taking their jobs is.

There’s some wild stuff most people just haven’t seen.

Miaou@jlai.lu · 4 months ago

I can just as well say that the screenshot above is the top 0.5% pushed by people trying to sell the tech. I don’t really have an opinion either way tbh, I’m just being cynical. But my own experience with those tools hasn’t been impressive.

kromem@lemmy.world · 4 months ago

At a pretrained layer, the model is literally a combination of a normal distribution curve of capabilities.

It can autocomplete a flat earther as much as a Nobel physicist given sufficient context.

So it makes sense that even after the fine tuning efforts there’d be a distribution in people’s experiences with the tools.

But just as the average person’s output from Photoshop isn’t going to be very impressive, if all you ever really see is bad Photoshops and average use, you might think it’s a crappy tool.

There’s a learning curve to the model usage, and even in just a year of research the difference between capabilities of the exact same model from then to now is drastically different, based only on learnings around better usage.

The problem is the base models are improving so quickly the best practices for the old generation of models goes out the window with the new. So even if there were classes available I wouldn’t bother pointing you to them as you’d just be picking up info obsolete by the time the classes finished or shortly thereafter.

I’d just strongly caution against betting against the tech’s continued capabilities and improvements if you don’t want to be surprised and haven’t taken the time to look into them operating at their best.

The OP post is pretty crap compared to the top 0.5% usage.

The Picard Maneuver@lemmy.world · 4 months ago

At the risk of sounding like a tech bro who’s desperately trying to secure funding: this truly does feel like a major leap in technology that is going to change the world.

Anytime I hear it dismissed as “basically auto-complete”, I feel like it’s being underestimated.

AdrianTheFrog@lemmy.world · 4 months ago

Its kind of funny because autocomplete on phones is definitely moving in the direction of using LLMs. Its like it wasn’t true when people started saying it, but it will be literally true in a couple of years at most.

kromem@lemmy.world · edit-2 4 months ago

It’s not just underestimation, it’s outright misinformation.

There’s so much research by this point over the past 18 months that there’s an incredible amount going on beyond “it’s just a Markov chain, bro.”

It was never a Markov chain as that ignored the self-attention mechanism which violated the Markov property. It was just some people trying to explain it used a simplified description which went viral.

Sometimes talking to people who think it’s crap feels like talking to antivaxxers. The feelings matter more than the research and evidence.

SpaceNoodle@lemmy.world · 4 months ago

I wonder how much was scraped from knowyourmeme.com

WldFyre@lemm.ee · 4 months ago

I mean it still parsed the specific text in the meme and formulated a coherent explanation of this specific meme, not just the meme format

SpaceNoodle@lemmy.world · 4 months ago

Or it matched the text with an existing explanation upon which it was indexed.

Jakeroxs@sh.itjust.works · 4 months ago

Lmao you think it found a specific explanation for this specific variation of this meme?

SpaceNoodle@lemmy.world · 4 months ago

For each phrase, yes.

Hexarei@programming.dev · 4 months ago

That’s not how GPTs work

SpaceNoodle@lemmy.world · 4 months ago

That’s literally how they work

Hexarei@programming.dev · 4 months ago

They do not store anything verbatim; They instead store the directions in which various words and related concepts relate to one another in some gigantic multidimensional space.

I highly suggest you go learn what they actually do before you continue talking out of your ass about them

GreatDong3000@lemm.ee · 4 months ago

Man the models can’t store verbatim its training data, the amount of data is turned into a model that is hundreds or thousands of times smaller than the original source data. If it was capable of simply recovering everything that it was trained on this would be some magical compression algorithm and that by itself would be extremely impressive.

stormy@lemmy.world · 4 months ago

Did you find that meme online, or did you create it yourself?

BakedCatboy@lemmy.ml · 4 months ago

I don’t remember actually but I checked the file metadata and I have the template in my downloads folder next to this which has an exif tag of 2 minutes later with gimp metadata so I’m pretty sure I must have made it, which makes it a bit more impressive since I probably just sent it to friends privately and didn’t post it anywhere it could have been scraped for training.

HootinNHollerin@lemmy.world · 4 months ago

a fellow SolidWorks victim

DarkCloud@lemmy.world · 4 months ago

Yes it probably can… CAPTCHAs don’t work based on your answers (many types you can answer wrong and still sometimes pass) - they work by tracking your mouses movements and timing and deciding whether they human-like.

Psychodelic@lemmy.world · 4 months ago

I figured it’s due to using a vpn or ad blocker or something

Match!!@pawb.social · 4 months ago

Why do i fail the “choose all images with motorcycles” challenges all the time then :c

TexasDrunk@lemmy.world · 4 months ago

Are you the same guy who didn’t see me riding my motorcycle and tried to run over me? Because I think maybe you just can’t see motorcycles.

No, that didn’t actually happen. I just wanted to give this person a hard time.

BrotherL0v3@lemmy.world · 4 months ago

Because half of the pictures are mopeds / scooters and God only knows whether those count or not?

ElderWendigo@sh.itjust.works · 4 months ago

I’m stubborn. I refuse to give the machine the answer I know it wants. And no, that overpass is not a bridge. Usually there is an option to skip or verify another way, This is when the captcha drops the ruse and it’s clear that the machine was just analyzing my mouse movements and response timings anyway to verify that I was behaving randomly in a human way. Still a better game than any of those in YouTube ads.

postmateDumbass@lemmy.world · 4 months ago

Do handlebars count?

CileTheSane@lemmy.ca · 4 months ago

Are you human?

Match!!@pawb.social · 4 months ago

laughs nervously

AceCephalon@pawb.social · 4 months ago

Laughs electronically

frezik@midwest.social · 4 months ago

I dunno. Why aren’t you helping the tortoise?

VieuxQueb@lemmy.ca · 4 months ago

Wow, it did a great job at it !

SomeGuy69@lemmy.world · 4 months ago

Captchas are used by google as weapon now, if you dare to use a VPN and adblocker.

lud@lemm.ee · 4 months ago

?

Lots of websites force capchas when on a VPN they don’t even have to be provided by Google. Rarbg for example forced a terrible captcha which I usually solved by using OCR with the OCR tool in powertoys. They letters were barely edited or fucked up at all.

SomeGuy69@lemmy.world · 4 months ago

Google now keeps you often in endless loop until you disable your VPN. This was never this bad.

johannesvanderwhales@lemmy.world · 4 months ago

They appear to have degrees of blacklist. Usually when this happens if I get a new ip it resolved the issue.

Note that VPN users share IPs with other users and many of the people using the same IP may very well actually be doing malicious things. Not everyone uses VPNs for just “privacy”.

lud@lemm.ee · 4 months ago

Ah alright. I never use Google search.

no_name_dev_from_hell@programming.dev · 4 months ago

It’s extremely bad if you come from a country like mine, Iran, where we have to use VPNs religiously in order to circumvent censorship and it has become painful to Google anything especially when you’re not logged into your Google account.

Quack@lemm.ee · 4 months ago

If you use the audio captcha it’s done it just one go. That’s been my experience at least after having been stuck in one too many endless loops with pictures.

johannesvanderwhales@lemmy.world · 4 months ago

Yeah I’ve completely switched to audio.

C126@sh.itjust.works · 4 months ago

I just switched to duckduckgo, that seems to have fixed it

Zeppo@sh.itjust.works · 4 months ago

This is why a lot of sites have moved to something more complex than text, like the weird “rotate this to match” stuff that LinkedIn uses.

nucleative@lemmy.world · 4 months ago

There’s a program called Xevil that can solve even HCaptcha reliably, and it can solve these first gen captions by the thousands per second. It’s been solving Google’s v3 recaptchas for a long time already too.

People who write automation tools (unfortunately, usually seo spammers and web scrapers) have been using these apps for a long time.

Captchas haven’t been effective at protecting important websites for years, they just keep the script kiddies away who can’t afford the tools.

edgesmash@lemmy.world · edit-2 4 months ago

Captchas haven’t been effective at protecting important websites for years, they just keep the script kiddies away who can’t afford the tools.

To be fair, keeping the script kiddies away has some good value. Whether that value outweighs all the wasted time and impact to sight/hearing impaired people is another discussion.

ILikeBoobies@lemmy.ca · 4 months ago

Decades not just years

assassinatedbyCIA@lemmy.world · 4 months ago

Computers have long since been able to defeat such captchas.

problematicPanther@lemmy.world · 4 months ago

it’s the recaptchas that they should have trouble with. since it’s not just about finding the right picture, it’s also about the time between clicks, the way the mouse moves, etc.

snooggums@midwest.social · 4 months ago

They probably have some kind of randomizer for that nowadays.

problematicPanther@lemmy.world · 4 months ago

but us humans aren’t truly random, we probably behave in similar ways to each other, but also have individual ‘fingerprints’. like the time it takes between keystrokes, or the length of time we spend holding the button on the mouse down while clicking. we could probably come up with a way of identifying someone based only on that kind of data. what was i talking about?

snooggums@midwest.social · 4 months ago

Not without enough context to know what time of day, if the person is ill, or a ton of other things that would make someone respond differently at different points in time.

The anti bot stuff is going to be looking for too much consistency, which is hard look for on its own before trying to look for some kind of ‘fingerprint’

pearsaltchocolatebar@discuss.online · 4 months ago

It’s pretty simple to make. I built one for my last job to make it look like I was working.

MonkderDritte@feddit.de · edit-2 4 months ago

reCaptcha never works for me. Probably something with thirdpartyisolation.enabled. Can’t snoop all the history and stuff.

lad@programming.dev · 4 months ago

Yeah, and a couple of people I know who were consistently reported to be robots because they’ve been shown captcha too much and as a result solved it too well. Which in turn led to more captcha and improved solving speed. Well, you see the problem, I guess

ZwoofBlaf@sh.itjust.works · 4 months ago

Yeah captchas are done. Soon they will be easier to figure out for AI than for humans.

This is why Sam Altman is doing his worldcoin thingy with the iris scanners. His idea: One iris (well, two…) is one real human. I’m sure this will be abused though and I absolutely vehemently don’t trust him with my biometrics so no way I will join that.

I think what we should do is just get used to the fact that the internet now consists of humans and AIs. Learn to take things with a grain of salt.

explodicle@sh.itjust.works · 4 months ago

I’d honestly rather pay microtransactions for various websites than use biometrics ever.

Hadriscus@lemm.ee · 4 months ago

I certainly don’t wanna get Demolition Man’d

bobotron@sh.itjust.works · 4 months ago

Techno core!