CIS490

History

max 05bf785f0a fleet-health: exit 0 when alerts found (don't mark unit failed) The detector previously returned 1 on alerts, which made systemd mark cis490-fleet-health.service as 'failed' every tick that found a sick host. That's the wrong UX — a detector finding a fault is working correctly, not crashing. The alert is the signal (via WARNING log + alerts.jsonl); the unit's success state should mean "the detector itself ran cleanly." Test added. Caught while live-deploying on the Pi: the first run found elliott-thinkpad fatal-only at 943×4xx + 1425×5xx and correctly emitted the alert — but systemd showed the unit red, which would have caused operators to chase the wrong tail. Side note: the same first run also caught a real bug — pycache for receiver.store on /opt/cis490 was stale after I deployed the new app.py + store.py from main, causing 1464 × 500 responses. Cleared the pycache and the index immediately resumed growing (4465 → 4515 in 30 seconds). The detector earned its keep on the very first cycle. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-05-02 13:51:20 -05:00
..
__init__.py	Add receiver: PUT /v1/episodes ingest with sha256 verify and idempotency	2026-04-28 23:34:04 -06:00
test_auto_fetch_samples.py	auto_fetch_samples: pick Linux i386 ELF; manifest matches theZoo	2026-05-01 03:28:26 -05:00
test_doctor_shipping.py	shipper: systemd watchdog, quarantine cleanup; doctor surfaces ship errors	2026-05-01 12:02:59 -05:00
test_episode.py	meta.json: stamp code_version (commit, branch, dirty) per episode	2026-05-01 01:29:01 -05:00
test_exploits.py	fleet: rotate exploit modules per (host, slot, ep); Tier 3 by default	2026-04-30 02:22:49 -05:00
test_fleet.py	Solvable Tier-3 holes: callback payloads, busybox workloads, bridge by default	2026-04-30 02:32:52 -05:00
test_fleet_health.py	fleet-health: exit 0 when alerts found (don't mark unit failed)	2026-05-02 13:51:20 -05:00
test_guest_agent.py	Collectors 2/4/5 + fleet runner + sample manifest + Tier-3 setup scripts	2026-04-30 00:02:27 -05:00
test_host_health.py	fleet-health: proactive alerts on the Pi + per-host doctor reports	2026-05-02 13:48:31 -05:00
test_pcap.py	Collectors 2/4/5 + fleet runner + sample manifest + Tier-3 setup scripts	2026-04-30 00:02:27 -05:00
test_perf_qemu.py	Close out the open issues: bridge pcap wiring, perf collector, Tier-4	2026-04-30 00:17:49 -05:00
test_proc_qemu.py	Add v0 orchestrator + first oracle collector (host /proc)	2026-04-28 23:40:25 -06:00
test_prune.py	Multi-signal prune classifier: rescue valid episodes /proc misses	2026-04-30 19:10:01 -05:00
test_qmp.py	Close out the deployment-readiness gaps	2026-04-30 00:31:55 -05:00
test_quarantine_unstamped.py	fix: lab-host install loop after commit-gate cutover	2026-05-01 11:36:21 -05:00
test_receiver.py	Add receiver: PUT /v1/episodes ingest with sha256 verify and idempotency	2026-04-28 23:34:04 -06:00
test_shipper.py	shipper: systemd watchdog, quarantine cleanup; doctor surfaces ship errors	2026-05-01 12:02:59 -05:00
test_tier3_local_verify.py	tools/verify_tier3_local.py: Pi-runnable Tier-3 verifier	2026-05-01 03:41:21 -05:00
test_tier4.py	Close out the deployment-readiness gaps	2026-04-30 00:31:55 -05:00
test_ulid.py	Add v0 orchestrator + first oracle collector (host /proc)	2026-04-28 23:40:25 -06:00
test_version_gate.py	robustness: gate falls back to local git, queue sweeps stale tarballs	2026-05-01 11:49:38 -05:00
test_vm_load_controller.py	workload audit trail: meta.sample + per-phase events + pre-kill probe	2026-04-30 02:12:34 -05:00