RL environment + GRPO-trained overseer that detects hallucination propagation in multi-agent LLM fleets 5-signal composite reward, 4 task tiers, 112 tests
-
Updated
Jun 24, 2026 - Jupyter Notebook
RL environment + GRPO-trained overseer that detects hallucination propagation in multi-agent LLM fleets 5-signal composite reward, 4 task tiers, 112 tests
LUMINA-30: non-binding boundary framework for preserving effective human refusal before irreversible AI consequences.
Telemetry-grounded, calibrated failure attribution for agent oversight (OTel GenAI + Who&When).
Lightweight proof-of-concept for oversight-centered metrology in coding agents: workflow-aware evaluation, interrupt channels, and claim-margin reporting beyond raw success scores.
Sentinel — embeddable inline AI oversight. Wraps the host AI's output in place so the human expert can validate, correct, and audit without leaving their workflow.
Reflex-interruption path verifier for AI actions — maps risk events to tactile signal classes and races body latency against commit delay
Add a description, image, and links to the ai-oversight topic page so that developers can more easily learn about it.
To associate your repository with the ai-oversight topic, visit your repo's landing page and select "manage topics."