CIS490

History

Max Gorog 4ab5477226 PIPELINE §5 step 1: fix four root-cause defects Diagnoses + fixes for the silent-collector / never-lands-session failures that the 200-episode quality probe surfaced (§3 evidence). All four address the producer; no compensating layers added. perf collector (rows_perf=0 on 100% of episodes): - perf stat -j writes to stderr by default with -p; we read stdout. Add --log-fd 1 so JSON reaches stdout where the parser sees it. - Event names come back annotated with the privilege scope perf actually measured ("cycles:u" under perf_event_paranoid=2). Strip the suffix so _build_row's plain-name lookups hit. Without this every metric was None even when perf reported real numbers. - tests/test_collectors_emit.py covers the regression with a real busy-loop fixture; emit-test discipline per §4.4. guest-agent collector (rows_guest=0 on 100% of episodes): - Alpine cloud image doesn't ship python3, so the in-guest agent's `#!/usr/bin/env python3` shebang silently fails. Add packages: [python3] to cidata user-data so cloud-init installs it before the OpenRC service starts. - Guest agent now exits nonzero (was: silent stdout fallback) when /dev/virtio-ports/cis490.guest.agent is missing, so OpenRC reports the failure to /var/log/cis490-agent.log instead of the bytes vanishing into the void. Refs §1. - Host-side collector emits guest_agent_connected / guest_agent_first_byte / guest_agent_silent_window into the orchestrator's events.jsonl. Future episodes show the in-guest failure mode per-episode instead of inferring from rows_guest=0. k-gamingcom missing qmp/netflow/pcap (also affected elliott on Tier-3 episodes — was misclassified as host divergence): - tools/run_tier3_demo.py was building EpisodeConfig WITHOUT qmp_socket / guest_agent_socket / bridge_iface — even though launch_target.sh creates the underlying chardevs and BRIDGE supplies the iface. tools/run_real_vm_demo.py wires them correctly; Tier-3 had a copy-paste gap. - tests/test_collectors_emit.py adds a source-grep regression so the wiring stays honest. samba_usermap_script never lands session (0/67 in §3 probe): - Bind handler default WfsDelay (~5s) gives up before bind_perl on Metasploitable2 has finished forking + binding LPORT under SLIRP+hostfwd. Bump to 30s; matches session_open_timeout_s in exploits/driver.py so framework + driver agree on the wait budget. Add ConnectTimeout=15 so the handler's bind connect has retry budget instead of one-shot. orchestrator/fleet.py: usable_modules + BRIDGE handling were both unconditional, so: - With BRIDGE set, requires_bridge modules were still being dropped — picker only ever returned samba_usermap_script across every slot/episode (the test_fleet_uses_all_modules_when_bridge_set failure on HEAD). - env.pop("BRIDGE") fired even when BRIDGE was the operator's explicit setup, breaking modules that need bridge mode (vsftpd backdoor on hardcoded port 6200, distccd, etc.). Both made conditional on bridge_set so the picker walks the full catalog under bridge mode and SLIRP-only modules still get a clean SLIRP env when BRIDGE is unset. receiver/app.py: half-pregnant v2 schema state in HEAD — calling store.ingest_stream(episode_type=..., benign_profile=...) with kwargs the matching store.py change was in the WIP stash. Removed v2 awareness from app.py so v1 episodes (what the producer ships today) get accepted again. SCHEMA_VERSION default reset to 1 to match. 229 passed, 0 failed. (HEAD had 15 failures, all linked to the half-pregnant v2 state above.) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-05-03 17:05:25 -05:00
..
__init__.py	Add receiver: PUT /v1/episodes ingest with sha256 verify and idempotency	2026-04-28 23:34:04 -06:00
test_auto_fetch_samples.py	auto_fetch_samples: pick Linux i386 ELF; manifest matches theZoo	2026-05-01 03:28:26 -05:00
test_collectors_emit.py	PIPELINE §5 step 1: fix four root-cause defects	2026-05-03 17:05:25 -05:00
test_doctor_shipping.py	shipper: systemd watchdog, quarantine cleanup; doctor surfaces ship errors	2026-05-01 12:02:59 -05:00
test_episode.py	meta.json: stamp code_version (commit, branch, dirty) per episode	2026-05-01 01:29:01 -05:00
test_exploits.py	fleet: rotate exploit modules per (host, slot, ep); Tier 3 by default	2026-04-30 02:22:49 -05:00
test_fleet.py	Solvable Tier-3 holes: callback payloads, busybox workloads, bridge by default	2026-04-30 02:32:52 -05:00
test_fleet_health.py	fleet-health: exit 0 when alerts found (don't mark unit failed)	2026-05-02 13:51:20 -05:00
test_guest_agent.py	Collectors 2/4/5 + fleet runner + sample manifest + Tier-3 setup scripts	2026-04-30 00:02:27 -05:00
test_host_health.py	fleet-health: proactive alerts on the Pi + per-host doctor reports	2026-05-02 13:48:31 -05:00
test_pcap.py	Collectors 2/4/5 + fleet runner + sample manifest + Tier-3 setup scripts	2026-04-30 00:02:27 -05:00
test_perf_qemu.py	Close out the open issues: bridge pcap wiring, perf collector, Tier-4	2026-04-30 00:17:49 -05:00
test_proc_qemu.py	Add v0 orchestrator + first oracle collector (host /proc)	2026-04-28 23:40:25 -06:00
test_prune.py	Multi-signal prune classifier: rescue valid episodes /proc misses	2026-04-30 19:10:01 -05:00
test_qmp.py	Close out the deployment-readiness gaps	2026-04-30 00:31:55 -05:00
test_quarantine_unstamped.py	fix: lab-host install loop after commit-gate cutover	2026-05-01 11:36:21 -05:00
test_receiver.py	Add receiver: PUT /v1/episodes ingest with sha256 verify and idempotency	2026-04-28 23:34:04 -06:00
test_shipper.py	shipper: systemd watchdog, quarantine cleanup; doctor surfaces ship errors	2026-05-01 12:02:59 -05:00
test_tier3_local_verify.py	tools/verify_tier3_local.py: Pi-runnable Tier-3 verifier	2026-05-01 03:41:21 -05:00
test_tier4.py	Close out the deployment-readiness gaps	2026-04-30 00:31:55 -05:00
test_ulid.py	Add v0 orchestrator + first oracle collector (host /proc)	2026-04-28 23:40:25 -06:00
test_version_gate.py	robustness: gate falls back to local git, queue sweeps stale tarballs	2026-05-01 11:49:38 -05:00
test_vm_load_controller.py	workload audit trail: meta.sample + per-phase events + pre-kill probe	2026-04-30 02:12:34 -05:00