Benchmarking AI persistent memory server against connected memory.

r/LocalLLaMA
Generative AI

Retrieval of only semantically similar memories using vector search is not sufficient to build an holistic context to feed to an llm. Most of the memory system works on the first concept of pure vector search, While running an experiment I found out that purely working on semantics would never give me a complete picture, the hybrid approach of semantic search + entity graph is the way forward.