fix: increase embedding and LLM timeouts for large local models

qwen3-embedding:8b needs >60s on first load. Set EMBEDDING_TIMEOUT=300 (worker executes at 2x = 600s) and TIMEOUT=600 for LLM calls. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-04-19 13:52:25 +08:00
parent 3a7249375c
commit 2ffdfd2976
1 changed files with 4 additions and 0 deletions
@@ -12,6 +12,10 @@ EMBEDDING_DIM=4096
 # Storage (local files)
 RAG_DIR=./rag_storage
 # Timeouts (in seconds) — increase for large local models
 EMBEDDING_TIMEOUT=300
 TIMEOUT=600
 # Server
 HOST=0.0.0.0
 PORT=9621