A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
☆87Dec 17, 2024Updated last year
Alternatives and similar repositories for goodai-ltm-benchmark
Users that are interested in goodai-ltm-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated 2 years ago
- Benchmarking LLM Inference Speeds☆14May 17, 2026Updated last month
- ☆10Nov 6, 2024Updated last year
- ☆89Dec 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- ☆19Aug 7, 2024Updated last year
- Track the progress of LLM context utilisation☆56Apr 14, 2025Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 9 months ago
- ☆13Jul 16, 2024Updated last year
- code for training and using chess embeddings models☆14Jun 9, 2024Updated 2 years ago
- Visualization of different agile methodologies☆19Apr 25, 2015Updated 11 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- call your javascript class without new because this is the classy way ;)☆10May 19, 2017Updated 9 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- ☆19Aug 23, 2025Updated 10 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- reusable frontend components for taskforce projects☆11May 8, 2023Updated 3 years ago
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 3 years ago
- Cortex: Advanced Memory System for AI Agents☆100Feb 10, 2026Updated 4 months ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆26Feb 18, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated last year
- ☆11May 2, 2022Updated 4 years ago
- ☆15Dec 12, 2025Updated 6 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Jan 23, 2025Updated last year
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year
- ☆14Apr 25, 2025Updated last year
- The official repo for SocKET: Social Knowledge Evaluation Tests☆24May 12, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆68Jun 27, 2025Updated last year
- blablado is an extensible Assistant that listens to your voice and can execute custom Python functions you provided. It can speak as well…☆69Aug 4, 2024Updated last year
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 8 months ago
- Bayesian scaling laws for in-context learning.☆16Mar 12, 2025Updated last year
- ☆13Mar 18, 2024Updated 2 years ago
- A full-stack document management and AI chat application that enables users to upload, manage, and chat with their documents using AI. Bu…☆16Aug 10, 2025Updated 10 months ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆32Nov 29, 2025Updated 7 months ago