Non-parametric baseline alongside GBT/MLP/CNN/GRU/LSTM/Transformer. Same BaseModel + schema-hashed checkpoint contract; sidecar is a pickled sklearn KNeighborsClassifier (.knn.pkl) handled by the existing checkpoint machinery alongside .xgb.json / .pt. KNN's storage cost = n_train_rows × n_kept_features × 4 bytes. At 660k windows × 145 kept (realistic mode) features = ~380 MB sidecar; at 230 features (oracle) = ~600 MB. Heavy but ships through the same artifact-upload path. trainer/run.py learns a third fit branch: - GBT — XGBoost early stopping on val mlogloss - KNN — fit() memorizes; "training time" is val/test predict cost - NN — train_nn loop (the rest) Manifest gains knn-realistic + knn-oracle at priority 95 (just below GBT). KNN's k=10 default lives in the model class — overriding via hyper.k requires adding --k to run.py first to avoid the unknown-arg exit-2 issue. Smoke verified on the 567-episode subset: knn oracle val=0.7365 test=0.1333 (held-out k-gamingcom) That val/test gap (0.74 → 0.13) is the cross-device generalization story: KNN memorizes elliott-thinkpad's local feature space and falls apart on the other host. Honest baseline for the comparison report. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| caddy-root.crt | ||
| cis490-autoupdate.service | ||
| cis490-autoupdate.timer | ||
| cis490-bootstrap.service | ||
| cis490-cert-fetch.service | ||
| cis490-cert-fetch.timer | ||
| cis490-dashboard.service | ||
| cis490-doctor-check.service | ||
| cis490-doctor-check.timer | ||
| cis490-fleet-health.service | ||
| cis490-fleet-health.timer | ||
| cis490-orchestrator.service | ||
| cis490-receiver.service | ||
| cis490-shipper.service | ||
| cis490-trainer-receiver.service | ||
| cis490-trainer-worker.service | ||
| lab-host.toml.example | ||
| README.md | ||
| receiver.toml.example | ||
| training_manifest.toml.example | ||
etc/
Templates for system-level files installed by scripts/install-*.sh:
cis490-receiver.service— systemd unit for the receivercis490-dashboard.service— systemd unit for the dashboard.wg live displayreceiver.toml.example— config template for the receivercis490-orchestrator.service(TODO) — systemd unit for the orchestratorcis490-shipper.service(TODO) — systemd unit for the shipperlab-host.toml.example(TODO) — config template for the lab host
See docs/deploy.md for the install flow.