MemU Achieves 92.09% Accuracy on the LoCoMo Benchmark
MemU achieves 92.09% average accuracy in Locomo dataset across all reasoning tasks, significantly outperforming competitors.

About Long Conversation Memory
The LoCoMo (Long Conversation Memory) dataset is the industry‑standard benchmark for evaluating long‑term memory and reasoning in conversational AI systems. It is built from very long multi‑session dialogues with rich temporal, personal, and event‑driven context, enabling comprehensive evaluation across multiple reasoning categories such as single‑hop retrieval, multi‑hop inference, temporal understanding, and open‑domain question answering. Originally developed through a hybrid human–machine annotation process and adopted widely by the research community, LoCoMo provides a standardized framework to assess an AI’s ability to recall, reason, and persist information across extended interactions.
Try MemU Now
Experience MemU’s long-term memory capabilities instantly on our cloud platform. Integrate easily with your AI products and see how persistent, self-evolving memories can make your AI more intelligent and responsive.
Startup-Friendly Free Plan
Get started quickly with MemU’s free plan for startups. Perfect for small teams building memory-driven AI applications or experimenting with AI products that benefit from long-term memory. Enjoy the full power of MemU’s memory capabilities without upfront costs.
- Free access to Response API and Memory API with limited usage.
- Easy onboarding for teams and fast integration into your AI projects.
- Ideal for experimenting, prototyping, and validating long-term memory in your AI applications.
Start Building Smarter AI Today
Explore MemU and see how long-term memory can enhance your AI projects. Begin creating AI that remembers, learns, and adapts.