fix: increase embedding and LLM timeouts for large local models

qwen3-embedding:8b needs >60s on first load. Set EMBEDDING_TIMEOUT=300
(worker executes at 2x = 600s) and TIMEOUT=600 for LLM calls.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
2026-04-19 13:52:25 +08:00
parent 3a7249375c
commit 2ffdfd2976
+4
View File
@@ -12,6 +12,10 @@ EMBEDDING_DIM=4096
# Storage (local files)
RAG_DIR=./rag_storage
# Timeouts (in seconds) — increase for large local models
EMBEDDING_TIMEOUT=300
TIMEOUT=600
# Server
HOST=0.0.0.0
PORT=9621