Vulnerability reports are not special anymore

(words.filippo.io)

149 points | by goranmoomin 5 hours ago

22 comments

  • themanmaran 3 hours ago
    I feel like it's also been overrun by a lot of spam. As someone running a company, I get 2-5 unsolicited "vulnerability reports" per week. Half of them are an LLM finding some bad CSS on our framer splash page. The other half I assume are an extortion attempt so we just mark as spam.

    Occasionally I see real security researchers on HN complaining that no one takes the disclosure seriously, or that people reply immediately with a cease and desist. But from the receiving end it's just because the spam is unmanageable.

    • gucci-on-fleek 12 minutes ago
      Yeah, I help review security reports for a small FOSS organization, and someone reported a "critical" vulnerability about a publicly-accessible SVN server. Like yes, that is indeed the purpose of hosting open source software. But at least that report was obviously bogus; much worse are the ones that look legitimate at first, so you have to read through dozens of AI-generated paragraphs to make sure that there's nothing valid hidden in there.
    • Gigachad 3 hours ago
      I'm getting CVE fatigue with all of these super ultra critical 10/10 vulnerabilities that are some node package that compiles my frontend can get stuck if I give it a malicious regex.

      It's hard to spot the stuff that actually matters.

      • themanmaran 1 hour ago
        Seriously. We got 116 github dependabot alerts this week. Half of them for dev dependencies.
        • jamesfinlayson 55 minutes ago
          I tried to raise that with my internal security team recently - don't clutter my vulnerability dashboard with issues in dev dependencies. They somewhat rightly pointed out that malware needs to be dealt even if it's a dev dependency. So my suggestion went nowhere because I guess we can't filter by type of vulnerability.
          • zmgsabst 29 minutes ago
            Dev dependencies is how they compromised SolarWinds and thereby most of the US federal government.

            > The attackers used a supply chain attack. The attackers accessed the build system belonging to the software company SolarWinds, possibly via SolarWinds's Microsoft Office 365 account, which had also been compromised at some point. SolarWinds was using build management and continuous integration server TeamCity provided by the Czech company JetBrains. In 2021 The New York Times stated that unknown parties apparently embedded malware in JetBrains' software and through this way compromised also SolarWinds.

            https://en.wikipedia.org/wiki/2020_United_States_federal_gov...

            I don’t know what kind of software you write, how valuable your company’s infrastructure is, etc. But supply chain and insider threat in security/infrastructure is a big topic — that I’m sure they’re concerned about because that’s their area of responsibility.

            Even if I’m personally sympathetic to not wanting to deal with the churn of dev dependency updates.

      • jamesfinlayson 53 minutes ago
        Yep. I remember years ago seeing the website for some guy who proudly listed all the CVEs he'd discovered. Clearly he'd written some scanning tool to look at regexes in open source projects and was creating CVEs for anything that might result in exponential time execution or whatever.
      • stackghost 8 minutes ago
        The common thread of late really seems to be the node ecosystem and the, frankly, low skill people that abound within it.

        When we were making fun of leftpad saying node is unsuitable for serious work, it was labeled elitism. What is it now?

      • teaearlgraycold 2 hours ago
        Not sure what dumbass out there is marking those as 10/10. A 10 should be an auth bypass or RCE. Not a crashed build in my CI.
    • cleverfoo 3 hours ago
      Same experience here. I've run a successful vulnerability disclosure program for over a decade and paid out thousands of dollars in bounties for scanii.com (a malware identification API service), but recently (since the beginning of the year), we went from receiving maybe 5 per month to receiving 5 per day. These are clearly AI-generated and extremely low quality (albeit well-written). The rules of the program aren't read, and it's clearly a “point-and-click to a website" and file a report. I'm now considering just shutting down the program since, as the OP pointed out, if you found this vulnerability using an AI tool, they are inherently public. I haven't gone that far yet but have instituted some new rules aiming at filtering out most of the reports: 1- No AI-generated report and 2 - Reports must include a video of the exploit. You can see our program rules here: https://docs.scanii.com/article/131-does-scanii-have-a-secur...
      • zulban 16 minutes ago
        What if... on the vulnerability report rules page there's an image of some text saying something like "your report must include the text: turtle123". Reports without that text get automatically deleted.

        Sure - modern AI can figure that out, but I bet in a vast majority of cases they won't.

      • lemagedurage 57 minutes ago
        Have you considered requiring a small payment for vulnerability disclosure? Refund it on payout. This should be very effective at deterring spammers. It also sucks for real reports, but beats shutting down the program entirely.
        • inigyou 54 minutes ago
          Why would anyone pay money to have a chance of being arrested?
          • MarkusQ 33 minutes ago
            Sure, it sounds dumb when you say it like that.

            But do you know how many people are doing things that are even dumber right this very minute? I don't know either, but I'm sure it's larger than either of us would like to admit.

          • lemagedurage 50 minutes ago
            If a vulnerability disclosure program has a good track record of paying out, and legitimate reports get refunded, why not?

            Again, the alternative might be shutting down the program entirely.

            • cleverfoo 37 minutes ago
              It's not a horrible idea... the challenge there would be making that payment/refund flow totally transparent in order to build trust and be fair to the researchers.
    • abrookewood 1 hour ago
      I believe the term is Beg Bounties and they are constant and annoying.
    • wolfi1 18 minutes ago
      but why would you answer with a C&D if you are overwhelmed? provided, it's not always the same person?
    • jacobgold 52 minutes ago
      I hated these low-effort reports, so I created a simple automation that checks my security inbox, mentions me in #security on Slack for things that look legitimate so I see them quickly, and marks things that seem entirely automated as spam.

      I still check the spam folder for legitimate emails, but so far there haven't been any false positives.

    • spoaceman7777 1 hour ago
      Have you considered having an agent, or just a model, classify/triage them for you? Modern problems require modern solutions.
    • ActorNightly 14 minutes ago
      Its been like that for half a decade across all software. People act like finding a linux kernel bug is a big deal, completely ignoring the fact that in order to exploit that bug, the attacker has to be able to run code on your computer in the first place, which is extremely hard to do these days remotely.

      Also people ironically just DGAF that much. The last actual bad exploit was log4shell in java, which given how it was introduced (i.e someone purposefully at Apache made it so a log statement can execute code, and nobody questioned it before pushing it to prod), should have been the signal for everyone to completely remove all Apache libraries from their services, but yet all the software is still being used.

      • Tepix 8 minutes ago
        These bugs are indeed important, you need them once you‘ve found a bug in an application.
  • socalgal2 1 hour ago
    I feel like the current situation is temporary. LLMs are finding all the bugs. LLMs are also help fixing most of the bugs. Once most of the bugs are fixed, LLMs should be good at finding bugs before shipping them, the stream of bug reports will die down, and we'll be back to vulnerabiltiy reports being special.

    Further, the fact that bugs are so easy to find by LLMs means there is strong incentives to find ways to minimize creating bugs in the first place. That could be new or better languages, less 3rd party dependencies, more vetted code, better linters, better fuzzers, whatever. The point the new reality of bugs being easy to find will, actually must, lead to less bugs eventually because the world can't function with easy to find bugs.

    • mackenney 1 hour ago
      That supposes that LLMs can write secure software. Also, if we assume that finding bugs is easier that not creating them (reasonable I would say), the supply of bugs will never be exhausted.
      • xboxnolifes 4 minutes ago
        What's the difference between finding bugs and not making? Just run the bug finding in during CICD.
      • jeremyjh 31 minutes ago
        How can it be easier to find them than to not create them? Whatever you do to find them, you could do before you release.
      • zulban 14 minutes ago
        > That supposes that LLMs can write secure software.

        I think we're at the point that the best LLMs can indeed write software that's far more secure than your average programmer. Partly because the average is so terrible.

      • socalgal2 1 hour ago
        It does not suppose that LLMs can write secure software
    • fajmccain 1 hour ago
      Lol you think LLMs are generating bug free code?
      • socalgal2 1 hour ago
        I never said that. I said they are good at helping fix them. Go read the bug reports on firefox, or Safari, or Chrome. Most of them have a fix. It might be wrong but it usually points in the right direction, which is a 1000x more than nearly all human bug reports. So, the LLM helps. which is all I stated.
  • cadamsdotcom 3 hours ago
    Security through obscurity was never a great strategy.. and now it’s not a strategy at all..

    Hopefully at the end of this decade, a ton of software practices have been overhauled to eliminate classes of problems. Memory-safe language use is a great start - but it’d be great to see innovation in checking for TOCTOU problems, improper/missing authn & authz, and many others.

    This is an engineering problem. It won’t be solved by models that “only do dumb shit 1/10th as often, only 0.01% of the time now not 0.1%!” It won’t be solved by adding more models to do even more double-checking before and after the work. It won’t be solved by hoping humans catch it in review. It isn’t solvable by adding outer loops of any sort - though we may get close. To truly solve this will take serious CS research.

    • whimblepop 2 hours ago
      Almost never do software companies even attempt to design secure systems. I'm not sure this requires new fundamental research so much as slightly giving a shit.
      • inigyou 52 minutes ago
        There is a reason Mythos only found one bug in curl and it wasn't very bad.
    • user3939382 3 hours ago
      Verifying correctness of an implementation is P NP, not serious CS research.
      • adrianN 2 hours ago
        Most verification is undecidable, lots of it is pspace complete. That doesn’t mean very much in practice since those are worst case bounds. People regularly solve problems that are undecidable for all practical instances that they care about.
      • bawolff 2 hours ago
        Verifying behaviour of an arbitrary program is uncomputable. However that doesnt mean you can't have proofs of behaviour of specific programs you create.

        Personally i have some doubts, a lot of research has gone into the idea without much to show for it, but its a very reasonable research area.

        • codebje 39 minutes ago
          There's lots of things to show for the research!

          Part of what the research shows is that correctness-by-proof has a cost in developer effort.

          If there really is a vulnerability-apocalypse due to AI, and it's not just a different flavour of AI hype, the cost of having insecure software will rise to the point that the cost of dealing with insecure or incorrect code at time of creation becomes less than the cost of ignoring it until it blows up.

          I doubt it'll rise so much that we'll want to face the cost of behaviour proofs for much code at all, but it's quite possible it'll rise enough that we want to do things like prove that indices are in bounds, at compile time, so vector accesses can skip checks without compromising safety.

        • crote 55 minutes ago
          I fear it'll just move the problem one layer up. Sure, you've now proven that the code matches the specification - but how do you ensure the specification is watertight?
  • david_shaw 3 hours ago
    At risk of quoting too much of the article, it opens with this:

    > A requirement for staying sane while working in public as an open source maintainer is realizing that every issue, PR, and piece of feedback is a present, not an obligation. You can accept it, ignore it, and use it partially or not at all.

    > Except…

    > For years, as lead of the Go Security team at the time, I’ve told new team members that it doesn’t apply to vulnerability reports. No, vulnerability reports are special. Security researchers are doing us a favor by reporting things confidentially instead of doing full disclosure, so we owe them something, which is not true of regular issues opened on the issue tracker.

    [...]

    > It’s 2026 and none of the premises are true anymore.

    I respectfully disagree.

    The premise is absolutely still true: if someone discovers a critical, exploitable vulnerability in your software, the impact and tradeoffs are exactly the same as they were before LLMs started finding bugs. There are just more of them now, so they're easier to come by.

    But that won't last forever, either. As LLMs find increasingly difficult-to-find vulnerabilities, there will be fewer of them to report. This is just chugging through the backlog.

    All of that said, I don't think finding vulnerabilities has really been the difficult security problem for most companies (or open source projects). The difficult problem is dedicating resources to fixing those vulnerabilities instead of building software, products, and/or infrastructure that people want. That problem is absolutely still here today, but I'm optimistic that agentic security developers will be able to take the burden off of development teams in the near future.

    For tokens, of course.

    • CJefferson 2 hours ago
      The problem is there used to be a fairly high correlation between ‘security report’ and ‘real vulnerability’. Not perfect but good enough. Now the two are almost entirely disconnected.
    • appplication 1 hour ago
      > But that won't last forever, either. As LLMs find increasingly difficult-to-find vulnerabilities, there will be fewer of them to report. This is just chugging through the backlog.

      I think your logic is partly correct but the fact that the same LLMs are allowing an exponential increase in insecure code generated is a counterbalancing point. I do not think this phenomena will slow down.

      • sneak 1 hour ago
        Nah, those same LLMs, if prompted correctly, will be able to do an audit pass and a fix pass on that LLM-generated code. It’s a tooling issue that will get fixed in time.
    • shakna 1 hour ago
      > But that won't last forever, either. As LLMs find increasingly difficult-to-find vulnerabilities, there will be fewer of them to report.

      That is not my experience at all. People will continue to high-volume spam intended behaviour as if it is a bug.

      There will be fewer reports that matter as you fix things - but the volume of reports will either stay steady or go up. Making it harder to even notice the ones that matter.

      • jcgrillo 53 minutes ago
        The problem always existed, but nobody amassed a sufficiently large army of trolls to exploit it until now. So it wasn't a priority to solve it before, but now it is. We're going to have to learn to differentiate reports that matter from those that don't. Classifying reports might actually be something you could productively use an LLM for..
        • shakna 22 minutes ago
          When we solved this problem for email... We just dumped everything similar to untrusted into the rubbish bin. Important things can vanish too, and that's an acceptable price.
    • cpuguy83 1 hour ago
      It's not (just) more of them, it's the same ones reported by multiple people.

      I think the point is those issues are now easily discoverable and are nearly public because of it.

  • bawolff 2 hours ago
    There are some problems with incentives in the vuln report space. People report trivial vulns and expect the same treatment as people reporting critical vulns. But this isn't new with AI. Look at all the ReDos vulns in npm ecosystem. Its questionable if its a vuln in general but half of them aren't even triggerable.
  • fastball 1 hour ago
    They weren't special even before LLMs. Drive-by script-kiddies would run some basic scripts against your platform and send generally-not-actually-a-vulnerability reports, claiming that these were big problems, and requesting to be paid bug bounties.
  • jerrythegerbil 2 hours ago
    Vulnerability reports were never special.

    The _demonstration_ of security impact through vulnerability reports was special. The automation of “demonstration of impact” with AI isn’t that at all. The last mile is human and always was. This isn’t to say it won’t change in the future, but that’s a fact of where we are now.

    Vulnerability reports aren’t special anymore. They never were. It was the impact, the demonstration, the communication that was special.

    When you realize that this is being written from the perspective of someone who does vulnerability reporting in a professional capacity, you’ll connect the dots. We took care to be kind and succinct because for many of us, we learned our skills from being on the development side of things first.

    Vulnerability reports aren’t special anymore. The only ones that felt special were the ones with human touch, the ones doing their job as an adversarial thinker, and taking the care to understand that net positive outcomes require coordination even if both parties don’t see eye to eye.

    Nothing has changed. It never was. You’re just inundated with AI slop; which as a practitioner who uses AI regularly I can say with absolute confidence. The end result is the same, the volume is increased, but the special thing was never the report itself.

    Finding a vulnerability was always the easy but high toil part. It was the care to communicate succinctly and be invested in the outcome that was special.

    Godspeed.

    • ofjcihen 1 hour ago
      This x 1000

      I’ve been screaming this from the rooftops. Impact is what was always important. No one is going to take down prod to do an emergency patch on an RCE that COULD NEVER ACTUALLY BE EXPLOITED.

      I feel like we’re witnessing the result of multiple roles suddenly becoming security aware but not having the background or understanding to make any sense of it.

      • cpuguy83 1 hour ago
        In an ideal universe yes. But we live in a world where vulnerability scanners reign supreme.
        • jamesfinlayson 43 minutes ago
          Yep, I've updated dependencies with an RCE that can't be exploited in my codebase just to keep my security team happy. Not worth the multiple arguments about it not actually being an issue.
  • agolio 2 hours ago
    Tangent point, I think more broadly this is a big piece of AI-cynicism in general- “x isn’t special anymore”.

    It’s tough staying motivated on a craft when an AI is nearly as good as you. Chess players manage to do it at least.

    • Avicebron 2 hours ago
      > Chess players manage to do it at least.

      The 5 on earth still getting paid to play chess?

      • fragmede 1 hour ago
        There's only one Magnus Carlsen, who earned > $1 million in 2025 for playing chess, but the long tail, there were 26 people who made more than $100k, https://thechessworld.com/articles/general-information/the-1...

        but like, if you mean literally "someone gave them money and they played a game of chess", the number becomes much bigger. Chess coaches, streamers, club instructors, exhibition players, league players, camp counselors, and titled players receiving appearance fees, etc. All told, you're looking at ten's of thousands across the world.

    • z0ltan 1 hour ago
      [dead]
  • woodruffw 3 hours ago
    I agree with this. One of the consequences of the "vulnpocalpyse" is that it's become even harder to sift through the noise: I triage well over a dozen reports a week, many of which are "real" in the sense that they reflect a genuine defect but otherwise have an unclear impact on a typical user. This has always been true of the median vulnerability report, but the volume means that I now lean much more heavily away from coordinated disclosure.

    One flipside to this is that, because many of these bugs are "shallow" to LLMs, it's actually easier than ever to moderate the worst participants in your vulnerability program -- if someone sends you slop, you can just ban them and wait for the next, better orchestrated LLM to send you a better report for the same vulnerability.

    • notnmeyer 3 hours ago
      this is hilarious and i might try it.
  • skybrian 2 hours ago
    I'm wondering whether this is a permanent change. After all the easy-to-find bugs have fixed and you can't find them just by asking an AI, perhaps security issues will deserve special treatment again.
  • jamesjhare 48 minutes ago
    "LLMs are as good as almost any security researcher"

    No they are not. Everything else can be safely ignored. The author is suffering from AI psychosis and needs to get some help.

  • sans_souse 1 hour ago
    I'm curious about people's experiences with Kalshi support in this context.
  • naturalmovement 1 hour ago
    Linus Torvalds once went on record saying security vulnerabilities are no more important than regular bugs.

    This of course made vulnerability researchers seethe worse than aggrieved Redditors.

    It turns out he was right all along.

    The author also gets it wrong by assuming that regular bug reporters are not "providing a service". They are.

    When I wrote up a bug report, I made sure it's thorough with detailed steps to reproduce. It takes a lot of time and I've done it professionally for projects you've absolutely heard of.

    Having said that, getting them ignored repeatedly and — even worse — having my detailed PRs rejected, sometimes within minutes, as if I'm some ignorant luser is why I don't do it anymore. My time is more valuable than your hubris.

    A lot of open source developers have their heads so far up their own asses they forgot that it takes a community for projects to be successful.

  • enraged_camel 1 hour ago
    >> A requirement for staying sane while working in public as an open source maintainer is realizing that every issue, PR, and piece of feedback is a present, not an obligation.

    I don't think the gift analogy works well. In most cultures, turning down or even ignoring a gift is considered anywhere from impolite to hugely offensive. But that's the opposite of open source: there's nothing wrong with requesting changes to a PR or even closing it.

    • cpuguy83 1 hour ago
      Plenty of people offended by closing a PR or issue unresolved.
  • zeveb 3 hours ago
    > If a security vulnerability is reported by someone who is also violating the CoC, what do you do? Do you ignore it? Fix it silently?

    Is this even a question? You triage and fix the vulnerability just like any other one. Are truths spoken by folks one dislikes — even for perfectly valid reasons — any less true?

    The only way I can imagine this somehow applying is if someone has a habit of reporting vulnerabilities which do not exist, or of exaggerating their severity. Is crying wolf a CoC violation? If so, then I can imagine that particular sort of bad behaviour justifying some consideration before acting on a report.

    • fragmede 2 hours ago
      How badly are they violating the code of conduct? It wouldn't be the first time a security researcher got thrown into prison or jail, in this line of work.
    • calvinmorrison 3 hours ago
      Will xorg backport patches from Xlibre?
      • inigyou 49 minutes ago
        No, because xorg is a dead project that doesn't take any patches from anywhere and xlibre has shit code quality and is probably vibecoded now
  • thoangai 1 hour ago
    [flagged]