chroma-core / context-rotLinks
This repository contains the toolkit for replicating results from our technical report.
☆138Updated last month
Alternatives and similar repositories for context-rot
Users that are interested in context-rot are comparing it to the libraries listed below
Sorting:
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆254Updated last month
- ☆232Updated 3 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆456Updated last month
- A Text-Based Environment for Interactive Debugging☆272Updated this week
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆475Updated 2 months ago
- MCP-Universe is a comprehensive framework designed for developing, testing, and benchmarking AI agents☆445Updated this week
- Public repository containing METR's DVC pipeline for eval data analysis☆117Updated 6 months ago
- Ranking LLMs on agentic tasks☆192Updated last month
- ☆100Updated last year
- ☆136Updated this week
- ☆449Updated 3 months ago
- ☆78Updated last week
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆187Updated last month
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆441Updated last month
- MCP-based Agent Deep Evaluation System☆135Updated last week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆136Updated 7 months ago
- An agent benchmark with tasks in a simulated software company.☆556Updated 3 weeks ago
- An Automatic Prompt Optimization Framework for Large Language Models☆122Updated 2 months ago
- ☆280Updated 2 months ago
- Official Repo for CRMArena and CRMArena-Pro☆118Updated 3 months ago
- A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating System…☆135Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆269Updated 2 months ago
- Code and data for the Chain-of-Draft (CoD) paper☆331Updated 6 months ago
- Agentic Web: Weaving the Next Web with AI Agents.☆369Updated last week
- Tutorial for building LLM router☆228Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 5 months ago
- An open-source tool for LLM prompt optimization.☆642Updated last week
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.☆130Updated this week
- Verifiers for LLM Reinforcement Learning☆75Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago