A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
☆86Dec 17, 2024Updated last year
Alternatives and similar repositories for goodai-ltm-benchmark
Users that are interested in goodai-ltm-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- Dialogue Action Tokens: Steering Language Models in Goal-Directed Dialogue with a Multi-Turn Planner☆31Jun 27, 2024Updated last year
- Benchmarking LLM Inference Speeds☆13Apr 7, 2026Updated 3 weeks ago
- ☆10Nov 6, 2024Updated last year
- LLM-Powered Data Discovery System for Tabular Data☆28Apr 7, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- ☆88Dec 15, 2023Updated 2 years ago
- An embeding-less RAG pipeline (Use with simple pip install eliteRAG)☆38Nov 28, 2025Updated 5 months ago
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- ☆19Aug 7, 2024Updated last year
- Track the progress of LLM context utilisation☆55Apr 14, 2025Updated last year
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 7 months ago
- ☆12Jul 16, 2024Updated last year
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- ☆19Aug 23, 2025Updated 8 months ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆14May 9, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- Recursive Self-Aggregation evals on ARC-AGI☆33Jan 26, 2026Updated 3 months ago
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 2 years ago
- A Free-Software JavaScript Library made by people for the people!☆10Aug 1, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated 10 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- ☆11May 2, 2022Updated 4 years ago
- ☆14Aug 15, 2024Updated last year
- ☆15Dec 12, 2025Updated 4 months ago
- ☆15Jul 9, 2025Updated 9 months ago
- ☆20Nov 1, 2024Updated last year
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 11 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Jan 23, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hackathon project for Snarky workshop.☆11Jun 21, 2019Updated 6 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year
- blablado is an extensible Assistant that listens to your voice and can execute custom Python functions you provided. It can speak as well…☆69Aug 4, 2024Updated last year
- A trivial programmatic Llama 3 jailbreak. Sorry Zuck!☆567Jan 26, 2025Updated last year
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 6 months ago
- A collection of interesting links, articles, research papers and projects related to knowledge graphs, GenAI and LLMs (large language mod…☆28Jul 5, 2024Updated last year
- RWKV-7 mini☆12Mar 29, 2025Updated last year