Log/surface the reason, pin LLM warm for latency, doc insurance rule

- Reason visibility: the reason WAS extracted ("disintegrated eyes") but only lived in the Odoo description note. Add it to the post-call log line and to the Odoo lead title so it's visible at a glance. - Latency: split the timing — Whisper is ~0.1s, latency is LLM-side. The ~3s tail was cold model reloads after Ollama's keep-alive expired. server.py now warms + pins the model on startup (keep_alive=-1, ollama ps UNTIL=Forever), removing cold first-turn stalls. Whisper size left alone (not the bottleneck). - CLAUDE.md: insurance rule (never suggest/guess the plan), latency note. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-06-27 04:24:10 +00:00
parent 8b52097713
commit ba36ae6891
4 changed files with 34 additions and 3 deletions
--- a/odoo_client.py
+++ b/odoo_client.py
@@ -62,7 +62,7 @@ def create_appointment_request(patient_name, callback_number, reason, preferred_
                               insurance=None, call_sid=None):
    """Create the request in Odoo. Returns (model, record_id) or raises OdooError."""
    uid, models = _connect()
-    summary = f"📞 Phone appt request — {patient_name or 'caller'}"
+    summary = f"📞 Phone appt — {patient_name or 'caller'}" + (f": {reason}" if reason else "")
    # description is an Odoo HTML field — build with <br/> so it renders in the UI.
    rows = [
        ("Name", patient_name),