[Feature]: Keep local-backend prompt prefixes warm to eliminate cold-session prefill

#58192 Investigating P3 comp/agent

[Feature]: Keep local-backend prompt prefixes warm to eliminate cold-session prefill

Local llama.cpp/vLLM-style servers pay a full prefill cost for every new session because the shared prompt prefix is not kept warm; the proposal adds an opt-in gateway watcher that periodically replays a minimal request, and the author notes it is already implemented in PR #57019.