A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you need to evaluate your own agents. See more in the blogpost:
☆84Dec 17, 2024Updated last year
Alternatives and similar repositories for goodai-ltm-benchmark
Users that are interested in goodai-ltm-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Nov 6, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- LLM-Powered Data Discovery System for Tabular Data☆24Jul 14, 2025Updated 8 months ago
- A unit test framework for prompts.☆11Feb 9, 2023Updated 3 years ago
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Intelligent file organization with computer vision, audio analysis, chunking, proactive AI-powered analysis, interactive classification, …☆30Updated this week
- An embeding-less RAG pipeline (Use with simple pip install eliteRAG)☆37Nov 28, 2025Updated 3 months ago
- A prototype agent with the purpose of evaluating the performance of a Large Language Model within a python terminal.☆13Aug 28, 2023Updated 2 years ago
- Text-To-3D dream-fusion☆13Apr 10, 2023Updated 2 years ago
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- A simple Docker Compose boilerplate for deploying Open WebUI and LiteLLM with Traefik for personal LLM use. Securely manage and access la…☆20Jun 3, 2025Updated 9 months ago
- ☆19Aug 7, 2024Updated last year
- Track the progress of LLM context utilisation☆55Apr 14, 2025Updated 11 months ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆73Sep 17, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆16Dec 9, 2023Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- ☆19Aug 23, 2025Updated 7 months ago
- Superposition Yields Robust Neural Scaling☆63Feb 12, 2026Updated last month
- ☆17Jan 3, 2025Updated last year
- ☆14Nov 12, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A simple editing tool for cutting, compressing, and processing visuals.☆22Mar 16, 2025Updated last year
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- a deep learning framework for essential protein prediction☆13Mar 24, 2023Updated 3 years ago
- Graphlit Platform☆31Feb 20, 2024Updated 2 years ago
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 2 years ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 3 months ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆25Feb 18, 2025Updated last year
- ☆25Jan 1, 2025Updated last year
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆25Jan 28, 2025Updated last year
- ☆11May 2, 2022Updated 3 years ago
- Unison syntax highlighting for VS code☆10Jul 13, 2022Updated 3 years ago
- Your missing guide how to operate with NSTextList and write bullet/numbered lists like Notes app.☆15Oct 16, 2024Updated last year
- ☆10May 6, 2024Updated last year
- ☆24Feb 26, 2026Updated 3 weeks ago
- ☆15Jul 9, 2025Updated 8 months ago