vLLM
Personalized memory and session summaries with vLLM.
Prerequisites: 1.
Prerequisites: 1. Start a Postgres + pgvector container (helper script is provided): ./cookbook/scripts/run_pgvector.sh 2. Install dependencies: uv pip install sqlalchemy ‘psycopg[binary]’ pgvector 3. Run a vLLM server (any open model). Example with Phi-3: vllm serve microsoft/Phi-3-mini-128k-instruct \ —dtype float32 \ —enable-auto-tool-choice \ —tool-call-parser pythonic Then execute this script – it will remember facts you tell it and generate a summary.