avc-phone-ai

tocmo0nlord/avc-phone-ai

Fork 0

Commit Graph

Author	SHA1	Message	Date
tocmo0nlord	ba36ae6891	Log/surface the reason, pin LLM warm for latency, doc insurance rule - Reason visibility: the reason WAS extracted ("disintegrated eyes") but only lived in the Odoo description note. Add it to the post-call log line and to the Odoo lead title so it's visible at a glance. - Latency: split the timing — Whisper is ~0.1s, latency is LLM-side. The ~3s tail was cold model reloads after Ollama's keep-alive expired. server.py now warms + pins the model on startup (keep_alive=-1, ollama ps UNTIL=Forever), removing cold first-turn stalls. Whisper size left alone (not the bottleneck). - CLAUDE.md: insurance rule (never suggest/guess the plan), latency note. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-27 04:24:10 +00:00
tocmo0nlord	5ed641255c	Revert Phase 1 STT/auth swaps: stay on Whisper + Twilio Auth Token Deepgram and the Twilio Standard API Key were reverted per decision: - bot.py: restore HintedWhisperSTTService (faster-whisper hotwords), default model medium; remove DeepgramSTTService import + DEEPGRAM_API_KEY. - server.py: restore TWILIO_AUTH_TOKEN for X-Twilio-Signature validation and the serializer auto-hang-up. Twilio signs webhooks with the Auth Token, so an API Key Secret cannot validate signatures. - .env.example: back to TWILIO_AUTH_TOKEN + Whisper STT vars. - .gitignore: ignore runtime *.log (avc_run.log). OLLAMA_MODEL stays activeblue-avc:latest (the existing pulled tag). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-25 01:06:24 +00:00
tocmo0nlord	c3c719b77e	Initial commit: avc-phone-ai codebase + CLAUDE.md	2026-06-23 22:38:22 +00:00

Author

SHA1

Message

Date

tocmo0nlord

ba36ae6891

Log/surface the reason, pin LLM warm for latency, doc insurance rule

- Reason visibility: the reason WAS extracted ("disintegrated eyes") but only
  lived in the Odoo description note. Add it to the post-call log line and to
  the Odoo lead title so it's visible at a glance.
- Latency: split the timing — Whisper is ~0.1s, latency is LLM-side. The ~3s
  tail was cold model reloads after Ollama's keep-alive expired. server.py now
  warms + pins the model on startup (keep_alive=-1, ollama ps UNTIL=Forever),
  removing cold first-turn stalls. Whisper size left alone (not the bottleneck).
- CLAUDE.md: insurance rule (never suggest/guess the plan), latency note.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-27 04:24:10 +00:00

tocmo0nlord

5ed641255c

Revert Phase 1 STT/auth swaps: stay on Whisper + Twilio Auth Token

Deepgram and the Twilio Standard API Key were reverted per decision:
- bot.py: restore HintedWhisperSTTService (faster-whisper hotwords), default
  model medium; remove DeepgramSTTService import + DEEPGRAM_API_KEY.
- server.py: restore TWILIO_AUTH_TOKEN for X-Twilio-Signature validation and
  the serializer auto-hang-up. Twilio signs webhooks with the Auth Token, so
  an API Key Secret cannot validate signatures.
- .env.example: back to TWILIO_AUTH_TOKEN + Whisper STT vars.
- .gitignore: ignore runtime *.log (avc_run.log).

OLLAMA_MODEL stays activeblue-avc:latest (the existing pulled tag).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-25 01:06:24 +00:00

tocmo0nlord

c3c719b77e

Initial commit: avc-phone-ai codebase + CLAUDE.md

2026-06-23 22:38:22 +00:00

3 Commits