JoshuaPurtell / LRCBenchLinks
Evals meant to evaluate language models' ability to reason over long contexts.
☆10Updated 10 months ago
Alternatives and similar repositories for LRCBench
Users that are interested in LRCBench are comparing it to the libraries listed below
Sorting:
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆82Updated 4 months ago
- ☆64Updated last month
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆49Updated 9 months ago
- look how they massacred my boy☆63Updated 9 months ago
- Plotting (entropy, varentropy) for small LMs☆97Updated last month
- Public repository containing METR's DVC pipeline for eval data analysis☆78Updated 3 months ago
- Train your own SOTA deductive reasoning model☆99Updated 4 months ago
- ☆55Updated this week
- ☆101Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆41Updated 2 weeks ago
- ☆66Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 10 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 8 months ago
- Tools to make language models a bit easier to use☆48Updated 2 weeks ago
- auto fine tune of models with synthetic data☆76Updated last year
- A toolkit for building computer use AI agents☆168Updated 3 weeks ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 4 months ago
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆60Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 8 months ago
- Simple Graph Memory for AI applications☆88Updated 2 months ago
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 6 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- ☆75Updated 7 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 10 months ago
- A user interface for DSPy☆162Updated last month