A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
☆86Dec 17, 2024Updated last year
Alternatives and similar repositories for goodai-ltm-benchmark
Users that are interested in goodai-ltm-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- ☆11Nov 20, 2020Updated 5 years ago
- LLM-Powered Data Discovery System for Tabular Data☆30Apr 7, 2026Updated last month
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- Intelligent file organization with computer vision, audio analysis, chunking, proactive AI-powered analysis, interactive classification, …☆34Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆89Dec 15, 2023Updated 2 years ago
- ☆19Aug 7, 2024Updated last year
- A simple Docker Compose boilerplate for deploying Open WebUI and LiteLLM with Traefik for personal LLM use. Securely manage and access la…☆21Jun 3, 2025Updated 11 months ago
- Track the progress of LLM context utilisation☆55Apr 14, 2025Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 8 months ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Visualization of different agile methodologies☆19Apr 25, 2015Updated 11 years ago
- call your javascript class without new because this is the classy way ;)☆10May 19, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- ☆19Aug 23, 2025Updated 9 months ago
- Superposition Yields Robust Neural Scaling☆65Feb 12, 2026Updated 3 months ago
- ☆14May 9, 2024Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- Graphlit Platform☆32Feb 20, 2024Updated 2 years ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 3 months ago
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Cortex: Advanced Memory System for AI Agents☆99Feb 10, 2026Updated 3 months ago
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated 11 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- ☆11May 2, 2022Updated 4 years ago
- See a video of driving between two locations atop the Google Street View Car☆26Mar 31, 2013Updated 13 years ago
- ☆15Dec 12, 2025Updated 5 months ago
- Salesforce AI Research's open diffusion language model☆63Oct 29, 2025Updated 6 months ago
- Your missing guide how to operate with NSTextList and write bullet/numbered lists like Notes app.☆15Oct 16, 2024Updated last year
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆29Jan 28, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jul 9, 2025Updated 10 months ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- Advertise your OSC app through OSCQuery.☆12Mar 15, 2025Updated last year
- The official repo for SocKET: Social Knowledge Evaluation Tests☆24May 12, 2025Updated last year
- AgentIR is a retriever specialized for Deep Research agents.☆55Apr 16, 2026Updated last month
- ☆67Jun 27, 2025Updated 10 months ago