Optimize Ollama Integration: Native /api/chat vs OpenAI-Compatible Endpoint
Feature proposal to replace the OpenAI-compatible endpoint with Ollama's native /api/chat endpoint, claiming true delta streaming, longer timeouts, full native parameter support, and ~15-20% lower latency, with a proposed 581-line adapter implementation.