Components
Gateway APIFastAPI service · request handling, safety rails, postproc
—
Inference (Ollama)Model serving · Llama 3.2 3B under TTC constitution
—
Knowledge baseBM25 grounding · curated TTC corpus
—
Circuit breakerFailure isolation around inference
—
Streaming endpointSSE /api/chat/stream
—