- • OpenAI-compatible (chat/completions)
- • SSE streaming tokens + tool-calls
- • auto conv_id (L2 persistence)
- • clients: OpenCode, Cursor, aider, ClawCode
- • All persistent data in PG
- • Pool memory = cache
- • pgvector for RAG/L3
- • Versioned migrations 001→019+
- • Deficit scoring (GPU/CPU fairness)
- • "Always answer" doctrine
- • No outbound 503
- • Auto-exclude unknown_model
- • Auto-review (1s, from phase 1)
- • Cross-pool forward (M7a)
- • Multi-role pipeline (N-pool scale)
- • SSE exposes review to clients
- • L2 conversations Fernet encrypted
- • User opt-in (toggle)
- • GDPR export + delete
- • Pool admin cannot read content
- • RED Qwen3-30B-A3B (admin)
- • Coder · Tank · Scout (dedicated)
- • RED daemon 3h + self_update
- • 4 active workers today