- Add AGENTS.md with repo guidelines
- Add lightrag-mcp: FastMCP server exposing insert_documents() + query_documents()
to LLM agents via stdio transport, talks to LightRAG REST API
- Add scripts/patch-vllm-cpu.py for CPU inference patching
- Add .env.vllm for vLLM configuration
- Update flake.nix with expanded dev shell
- Update .env.lightrag
- Remove CLAUDE.md (replaced by AGENTS.md)
qwen3-embedding:8b needs >60s on first load. Set EMBEDDING_TIMEOUT=300
(worker executes at 2x = 600s) and TIMEOUT=600 for LLM calls.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Impure devShells using uv + Python 3.12 for each project.
LD_LIBRARY_PATH set for NixOS native wheel compatibility.
Neo4j managed via nix run .#neo4j-start / neo4j-stop.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>