fix: increase embedding and LLM timeouts for large local models
qwen3-embedding:8b needs >60s on first load. Set EMBEDDING_TIMEOUT=300 (worker executes at 2x = 600s) and TIMEOUT=600 for LLM calls. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
@@ -12,6 +12,10 @@ EMBEDDING_DIM=4096
|
|||||||
# Storage (local files)
|
# Storage (local files)
|
||||||
RAG_DIR=./rag_storage
|
RAG_DIR=./rag_storage
|
||||||
|
|
||||||
|
# Timeouts (in seconds) — increase for large local models
|
||||||
|
EMBEDDING_TIMEOUT=300
|
||||||
|
TIMEOUT=600
|
||||||
|
|
||||||
# Server
|
# Server
|
||||||
HOST=0.0.0.0
|
HOST=0.0.0.0
|
||||||
PORT=9621
|
PORT=9621
|
||||||
|
|||||||
Reference in New Issue
Block a user