NODE720 — SAST · proven on real-world npm code

01 — THE TRIAL

Real projects. Adversarial triage.

A SAST scanner only earns trust on code it has never seen. We froze an engine snapshot and ran node720 scan over the latest published source of 22 of the most-depended-on npm packages — roughly 250,000 lines. Then a 12-agent triage pass plus a manual root-cause review read the actual flagged source for every finding and labelled each one true-positive, false-positive, or conditional.

Ten of these packages are so heavily audited that the correct answer is nothing. node720-static returned exactly that — zero findings, zero noise — on all ten:

axiosexpressws semverjs-yamlglob uuidchalkdebug body-parser

02 — THE HONEST RESULT

Then it cried wolf — twice.

node720-static keeps two tiers strictly separate: a confirmed tier (Class A/B-high) that is meant to be safe to fail a CI build on, and an informational C tier (reachability hints, never a headline). The flagship promise is zero false positives at the confirmed tier.

On this set, that promise failed. The confirmed tier fired twice — once on pug, once on commander — and both were false positives. Either one would have broken a popular, CVE-free package's build on intended behaviour. The C tier, meanwhile, was ~89% false-positive and found no true bugs. We are showing you the failure because the fix is the product.

Package	Confirmed	C-tier (info)	Verdict on every finding
pug	1	4	FP (template compile) + conditional reachability
commander	1	0	FP (`spawn(process.argv[0])`)
moment	0	30	FP — proto reads + minified-bundle duplicates
qs	0	13	FP — guarded merge + cross-file over-fire
cheerio	0	9	FP — DOM `.find()` read as a Mongo query
ejs · lodash	0	10	FP — flagged the library's own mitigations
handlebars · async · minimist · node-fetch	0	6	FP + conditional (node-fetch SSRF)
TOTAL · 22 pkgs	2	72	0 true positives · 0 crashes

// Tiers are reported separately on purpose. The confirmed tier is the only one sold as 0-FP; C is explicitly noisy reachability and must never blend into the headline number. Both numbers above are pre-fix. §05 is the after.

03 — ANATOMY · WHAT EXACTLY WENT WRONG

Two confirmed findings, read line by line.

Both trace to one unsound assumption: that a library's own API parameters are attacker-controlled. Scanned in isolation, a library has no HTTP request, no CLI argv — so seeding taint from its exported parameters manufactures "untrusted" data that does not exist.

pug lib/index.js:288 · class code · CWE-94

False positive → re-tiered

// pug compiles the developer's OWN template into a function. // templateName is hardcoded to 'template' (line 280). return new Function('', parsed.body + ';return template;')() // ▲ // parsed.body is JS that PUG ITSELF generated from the // template source — runtime locals never flow here.

Why it is not a confirmed bug

This is the compile-and-eval step every template engine performs. The "source" was options.filename / the template — a library API parameter, not a proven untrusted boundary.

Template→Function is the SSTI/RCE class — but only if the application compiles an untrusted template. You cannot statically confirm that against pug in isolation.

Legitimate as a C-tier "dangerous if the template is untrusted" note. Unjustifiable as a build-failing finding.

commander lib/command.js:1286 · class command · CWE-78

False positive → suppressed

// launching a JS subcommand with the Node binary itself proc = childProcess.spawn(process.argv[0], args, { stdio: 'inherit' }) // ▲ // argv[0] === process.execPath — the Node.js interpreter // path, set by the runtime. shell:true is NOT passed.

Why it is not a confirmed bug

The "tainted program" is process.argv[0] — the Node interpreter path. An attacker controls argv[1..], never argv[0].

No shell:true, so args cannot inject a shell command, and the spawned program is node itself — the standard mechanism for launching a subcommand.

The tracker folded a taint marker into process.argv and treated argv[0] as input. It is not. Wrong even in an app context.

04 — THE ROOT CAUSE & THE FIX

A trust-source model, plus five hardenings.

The root fix (P0) is a trust-source model: taint originates only at a genuine untrusted boundary — an HTTP request member, CLI argv[2+], a file/network read, or an explicit taint declaration. A flow whose only origin is an exported API parameter may still surface at C (reachability) but can never mint a confirmed verdict. The param-as-source recall lever stays — it just can't fail your build alone. Five detector fixes clear the C-tier noise.

P0

Trust-source model

Confirmed tier requires a real boundary source. A bare param-seed flow is capped at C. argv[0]/argv[1] (node + script path) excluded from CLI sources. Fixes both confirmed FPs.

2 → 0

confirmed

P1

proto: read/write split + mitigation-aware

A bracket read can't pollute a prototype — suppressed unless the key is request-sourced. Recursive deep-merge is the gadget; a guarded/shallow target[k]=src[k] copy (handlebars, ejs, qs) is cleared.

~36

P2

cross-file summary confined to the function

The param→sink scan no longer matches an unrelated sink elsewhere in the module, and skips soft method-name collisions (RegExp.exec, TypedArray.set) on benign first-party helpers.

~19

P3

skip minified / generated artifacts

*.min.js, *.bundle.js, dist/, and — added in round two — any file with a generator banner (js_of_ocaml, @generated, webpack) or a minified signature. Generated bundles aren't authored source; they re-report sinks and trip on runtime machinery.

~12

P4

type the NoSQL sink

A Mongo query is an object filter on a DB handle. A CSS-selector string on cheerio's .find() or an Array.find predicate no longer collides with mongo.query.

3

P5

mass-assignment: real source, honest words

Object.assign is flagged only when the source is a genuine untrusted boundary — not a dev-supplied config/callback. The fabricated "request body" wording is gone.

4

05 — AFTER · MEASURED, NOT CLAIMED

Confirmed tier back to true zero.

Re-scanning the locally reproducible subset of the corpus with the fixed engine: the build-failing tier no longer fires on a single audited library, and the C-tier noise collapses. The genuinely-useful conditional detections (pug/template, node-fetch/SSRF, traversal) survive — correctly — as honest C-tier reachability.

2 → 0

confirmed-tier
false positives

40 → 9

C-tier findings
(reproducible subset · −78%)

0

scan crashes
before & after

17

corpus tests now
gating every commit

// "Reproducible subset" = the corpus packages installed in the test environment (moment and node-fetch were not, so their pre-fix 32 C-tier findings are excluded from the −78% to keep the comparison apples-to-apples). The headline guarantee — 0 confirmed-tier FPs — is measured across the full installed corpus.

06 — EXPLOITABILITY PATHS · WHAT A REAL TRACE LOOKS LIKE

It still confirms the real ones.

Suppressing library-param noise does not blunt the engine on genuinely vulnerable code. Here is a real, non-obvious 2026 CVE node720-static confirms end-to-end — a tainted span that reaches the SQL sink past every value check — alongside an honest conditional trace that is correctly held at C.

@nocobase/database CVE-2026-41640 · class sql · CWE-89

Confirmed · build-failing

// attacker controls an association name; recursive eager- // loading concatenates it straight into SQL. app.get('/api/:table', (req, res) => { const appends = req.query.appends // untrusted source db.repository(req.params.table).find({ appends }) }) // …deep in the ORM, once per association: sql += ` LEFT JOIN ` + assoc + ` ON …` // SQL sink

The trace node720 reports

source · taintedreq.query.appends

propagates · object propertythrough { appends } into .find()

cross-file · ORM internalsrecursive eager-load, once per association name

sink · keyword positiontainted span becomes SQL structure, not a value → injection

node-fetch request URL · class ssrf · CWE-918

Conditional · held at C

// node-fetch's entire job is to fetch a URL it is given. const res = await fetch(url) // ▲ // `url` is a library API parameter. SSRF is real IFF the // APP passes an untrusted URL — not a node-fetch defect.

Why this stays informational

The SSRF shape is genuine, but the source is a library parameter, not a proven request boundary. Under the trust-source model this can reach C (reachability) — "SSRF if the app feeds an untrusted URL here" — but it can never become a confirmed, build-failing verdict.

That is the difference the trial taught us: a dangerous shape is a hint; a dangerous shape fed by a proven untrusted source is a finding.

// node720-static is measured at 91/100 on the held-out real-vuln corpus (reachability recall 90.7%, false-positive rate 0/64), and the same detector core runs byte-for-byte at runtime in node720-rasp. See How it works for an end-to-end RCE caught by both engines.

07 — THE STANDING GATE

The trial is now a test.

A one-off audit ages out. So the corpus is a re-runnable battery: it hard-fails CI on a single confirmed-tier finding on any audited library, and tracks each package's C-tier count against a recorded ceiling so a detector change cannot quietly regress precision.

# test/realworld-fp-corpus.test.js — runs on every commit

// HARD GATE — zero build-failing findings on audited libraries
✔ corpus: ZERO confirmed-tier findings on audited libraries
// TRACKED — C-tier reachability must not regress past its ceiling
✔ corpus: commander C-tier within ceiling (2)
✔ corpus: pug C-tier within ceiling (7)
✔ corpus: qs C-tier within ceiling (3)
✔ corpus: cheerio C-tier within ceiling (4)
✔ corpus: axios · express · semver · js-yaml · glob · debug · body-parser  (0)
   …  17 pass · 0 fail   // a confirmed FP on a clean lib turns this red

// The same discipline as the engine's existing fpRate 0/64 guarantee — now extended to real, audited, third-party code. Stability held throughout: 0 crashes, before and after.

08 — ROUNDS TWO THROUGH FOUR · THE GATE EARNS ITS KEEP

Then ten more. Twenty more. A hundred more.

A 0-FP claim is only as good as the next package it has never seen. So we kept going — ten, then twenty, then a hundred more, all at their current versions — to a standing corpus of 150 of the most-depended-on libraries, the dependency backbone of npm:

mongoosefastifykoazod undicigotsequelizemysql2 jsdommarkdown-itjsonwebtokenshelljs passportsanitize-htmlnode-forgebcrypt +134 more

0

confirmed-tier
false positives · all 150

0

scan crashes
across the corpus

150

real packages
validated in total

136 → 64

C-tier noise
cut by the round fixes

The confirmed tier stayed at zero the whole way — but each sweep did surface real precision gaps, and that is exactly what it is for. Round two caught xml2js: four confirmed-tier hits, all inside one 3.4 MB, 28,000-line file:

xml2js lib/xml2js.bc.js · a generated js_of_ocaml bundle

False positive → root-caused

//# Generated by js_of_ocaml — OCaml compiled to JS function caml_js_expr(s){ console.error("caml_js_expr: fallback to runtime evaluation") return eval(caml_jsstring_of_string(s)) // runtime primitive }

Why it fired — and the fix

The four hits were eval() calls inside the OCaml-to-JS runtime's own primitives (caml_js_expr, caml_js_eval_string…) — not authored application code, and not reachable by any web request.

The readable source (parser.js, builder.js) was clean. The bundle was the problem — so we taught the scanner to skip generated artifacts by content (a generator banner or a minified signature), not just by filename.

Result: 4 → 0. A real precision gap on real code, found by the sweep and closed the same day — then locked behind the standing gate so it can't come back.

zod · fastify · pino C-tier reachability · CWE-915 / CWE-829

Noisy informational → tightened

// internal config merges — NOT mass-assignment of request data options = Object.assign({}, options) // fastify: shallow copy opts = Object.assign({}, defaults, opts) // pino: merge defaults Object.assign(merged, descriptors) // zod: into a fresh {}

Twenty more libraries → two precision fixes

Round three's twenty packages produced no confirmed-tier hits, but a wave of C-tier Object.assign findings mislabelled internal config merges as mass-assignment. A merge into a fresh object is a clone, not privilege escalation.

So mass-assignment now fires only when the source is genuinely request-derived (preserving the real Object.assign(entity, req.body) exploit), and cross-file data-shape reachability is no longer double-counted across module boundaries.

Result: C-tier noise on the corpus cut from 136 to 64 — confirmed tier still zero.

round four · 100 more test payloads · build bundles

Noise → skipped

// confirmed hits that were NOT real vulnerabilities: object-path/test.js // the lib's OWN security test payloads nedb/out/nedb.js // a generated browserify bundle

A hundred more libraries → one more fix

The 100-package sweep's confirmed hits were a dependency's own test suite (intentional __proto__ payloads it asserts are blocked) and a build bundle — not code anyone deploys.

So the scanner now skips dependency test/benchmark trees and build-output dirs by default. The dependency backbone of npm, scanned clean.

Staying quiet on clean code is half the job. The other half: does it fire on the vulnerable one? Same library, two versions —

ejs CVE-2017-1000228 · code injection · CWE-94

True positive · vulnerable vs. patched

// ejs@2.5.7 — the attacker-controllable option is // interpolated into the compiled function's param list: fn = new Function(opts.localsName + ', escapeFn, …', src) // ▲ unsanitized → code injection // ejs@latest — localsName is validated against an // identifier regex first → node720 reports nothing.

It catches the real one — and only the real one

ejs@2.5.7 (the vulnerable version): 1 confirmed code-injection finding. ejs@latest (patched): 0. The same scan, the same rule — it fires on the defect and stays silent on the fix.

Beyond libraries, on a vulnerable application (DVNA) with real request→sink flows the confirmed tier lights up as designed. Audited code stays clean; vulnerable code does not.

// Across four rounds — 150 packages, the dependency backbone of the npm ecosystem — node720-static's confirmed tier fires zero false positives, with zero crashes, while still catching real, documented vulnerabilities in the versions that have them. Every gap each sweep exposed is a regression test today (threat-intel/realworld-corpus.js).

RUN IT ON YOUR CODE

Scan every line before it ships.

node720-static reuses the runtime detectors byte-for-byte to find source→sink flows — no runtime trigger required. SARIF or text output, guard-aware, inter-procedural and cross-file.

$ npx node720 scan ./src --format sarif $ npx node720 scan ./src # human-readable text

→ Full install & CI setup → Watch one detector think