A–W Methods & Evidence

Plain-English description of how we measure decision advantage, what “Δ in pp” means, and how your team can reproduce results offline.

Download GovEval Kit (ZIP) Evaluator details & hash

Key definitions

Study design (surrogate, non-kinetic)

We evaluate in a laser-tag–style training surrogate (no weapons, no kinetic modeling). For each scenario (terrain, weather, EW conditions, force ratios):

  1. Run Baseline.
  2. Run A–W under the same seed and conditions.
  3. Record WIN/LOSS and round-level attributes to CSV.
  4. Repeat thousands of times; compute WR for each arm and Δ in pp with 95% CIs.

Because both arms use identical seeds, differences isolate the contribution of A–W.

Headline acceptance gates

What actually improves (human-in/on-the-loop)

For Government Evaluators

Kit Version
GovEval_AW_Kit_v1r1
SHA‑256 (zip)
f472522f2bd87c29b03a52a4d719ed5dda3482d50d615e139e449740f9a31a86
File size
1.6 MB
Last Updated (UTC)
2025-09-08 20:55Z
Contact
[email protected]

If your policy forbids web downloads, reply by email to arrange encrypted, couriered media.

Government-run replication (air-gapped)

We provide a turnkey kit so your lab can re-run the study offline:

Outcome: either the surrogate deltas replicate within CI on your hardware, or they don’t. The kit is designed to make that determination quickly and cleanly.

Integration (no rip-and-replace)

A–W runs as a sidecar next to your existing C2/mission tools:

Contact

For kit access, briefings, or licensing discussions: [email protected]