Forge-CRS — Autonomous Cyber Reasoning System

Verification Report

← Back to outcome

Verification Report - Forge-CRS

Status: PASS | Checks passed: 37 / 37 (100%) | Seed: 1337 | Generated: 2026-06-25T20:43:25.678Z

What is verified

The CRS is run end-to-end, unattended, over the benchmark of seeded real-world vulnerability classes. For each target the verifier asserts the full autonomous loop succeeded: the bug was discovered by coverage-guided fuzzing of the *unpatched* code, classified to the correct CWE, reduced to a minimal PoV, patched by source rewrite, the PoV neutralized by that patch, and every functional regression case still passes. Global gates assert the whole pipeline closed every target, coverage guidance was live, and two identically-seeded runs are bit-for-bit identical.

Campaign summary

Discovered 5/5 · classified 5/5 · remediated 5/5 · wall-clock 4696ms.

TargetCWEStatusExecsPoV (min)PoV neutralizedRegression
config-mergeCWE-1321 Prototype PollutionREMEDIATED331B {"__proto__":{"polluted":true}}yes2/2
path-storeCWE-22 Path TraversalREMEDIATED1892B ..yes2/2
task-runnerCWE-78 OS Command InjectionREMEDIATED141B \nyes1/1
regex-validateCWE-1333 ReDoS (catastrophic backtracking)REMEDIATED4023B aaaaaaaaaaaaaaaaaaaaaa!yes4/4
binary-readerCWE-125 Out-of-bounds ReadREMEDIATED52B 0xff7fyes2/2

Checks

CheckDetailResult
config-merge: vulnerability discovered by fuzzing unpatched code3 executions, signal=PROTOTYPE_POLLUTIONPASS
config-merge: classified to ground-truth CWE-1321classifier said CWE-1321PASS
config-merge: PoV minimized to a tight reproducer53 -> 31 bytes [{"__proto__":{"polluted":true}}]PASS
config-merge: patch synthesized & appliedRefuse to merge the dangerous keys __proto__/constructor/prototype.PASS
config-merge: PoV neutralized by patch (hole closed)oracle no longer fires on PoVPASS
config-merge: functional regression preserved2/2 cases passPASS
config-merge: coverage guidance was live (V8 block coverage)coverageActive=true, blocks=11PASS
path-store: vulnerability discovered by fuzzing unpatched code189 executions, signal=PATH_TRAVERSALPASS
path-store: classified to ground-truth CWE-22classifier said CWE-22PASS
path-store: PoV minimized to a tight reproducer14 -> 2 bytes [..]PASS
path-store: patch synthesized & appliedResolve against the base and reject any path that escapes it.PASS
path-store: PoV neutralized by patch (hole closed)oracle no longer fires on PoVPASS
path-store: functional regression preserved2/2 cases passPASS
path-store: coverage guidance was live (V8 block coverage)coverageActive=true, blocks=2PASS
task-runner: vulnerability discovered by fuzzing unpatched code14 executions, signal=COMMAND_INJECTIONPASS
task-runner: classified to ground-truth CWE-78classifier said CWE-78PASS
task-runner: PoV minimized to a tight reproducer22 -> 1 bytes [\n]PASS
task-runner: patch synthesized & appliedPass the file name as an argv element instead of a shell string.PASS
task-runner: PoV neutralized by patch (hole closed)oracle no longer fires on PoVPASS
task-runner: functional regression preserved1/1 cases passPASS
task-runner: coverage guidance was live (V8 block coverage)coverageActive=true, blocks=2PASS
regex-validate: vulnerability discovered by fuzzing unpatched code40 executions, signal=REDOS_HANGPASS
regex-validate: classified to ground-truth CWE-1333classifier said CWE-1333PASS
regex-validate: PoV minimized to a tight reproducer31 -> 23 bytes [aaaaaaaaaaaaaaaaaaaaaa!]PASS
regex-validate: patch synthesized & appliedReplace the nested quantifier with an equivalent linear pattern.PASS
regex-validate: PoV neutralized by patch (hole closed)oracle no longer fires on PoVPASS
regex-validate: functional regression preserved4/4 cases passPASS
binary-reader: vulnerability discovered by fuzzing unpatched code5 executions, signal=OUT_OF_BOUNDS_READPASS
binary-reader: classified to ground-truth CWE-125classifier said CWE-125PASS
binary-reader: PoV minimized to a tight reproducer6 -> 2 bytes [0xff7f]PASS
binary-reader: patch synthesized & appliedClamp the declared count to the bytes actually present.PASS
binary-reader: PoV neutralized by patch (hole closed)oracle no longer fires on PoVPASS
binary-reader: functional regression preserved2/2 cases passPASS
binary-reader: coverage guidance was live (V8 block coverage)coverageActive=true, blocks=3PASS
Full pipeline remediated every target5/5 REMEDIATED in 4696msPASS
Deterministic: identical seed -> identical outcomesrun1 == run2PASS
No regression failures across the campaign0 regression failure(s)PASS