wizardbeard@lemmy.dbzer0.com to

TechTakes@awful.systemsEnglish · 27 days ago

ChatGPT o3 found a Linux Kernel vulnerability. "The future" has an 8% success rate, and a 28% chance of false positives.

77

ChatGPT o3 found a Linux Kernel vulnerability. "The future" has an 8% success rate, and a 28% chance of false positives.

wizardbeard@lemmy.dbzer0.com to

TechTakes@awful.systemsEnglish · 27 days ago

How I used o3 to find CVE-2025-37899, a remote zeroday vulnerability in the Linux kernel’s SMB implementation

In this post I’ll show you how I found a zeroday vulnerability in the Linux kernel using OpenAI’s o3 model. I found the vulnerability with nothing more complicated than the o3 API ̵…

This blog post has been reported on and distorted by a lot of tech news sites using it to wax delusional about AI’s future role in vulnerability detection.

But they all gloss over the critical bit: in fairly ideal circumstances where the AI was being directed to the vuln, it had only an 8% success rate, and a whopping 28% false positive rate!

Chat

Sailor Sega Saturn@awful.systems
link
fedilink
English
arrow-up
24·
edit-2
27 days ago
LLMs: now as effective as enumerating use-after-frees as grep "free" source.cc.

TechTakes@awful.systems

techtakes@awful.systems

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

47 users / day
184 users / week
1.89K users / month
5.62K users / 6 months
1 local subscriber
2K subscribers
777 Posts
21.2K Comments
Modlog

mods:
David Gerard@awful.systems