JoshuaPurtell / LRCBench
Evals meant to evaluate language models' ability to reason over long contexts.
☆9Updated 8 months ago
Alternatives and similar repositories for LRCBench
Users that are interested in LRCBench are comparing it to the libraries listed below
Sorting:
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆78Updated 2 months ago
- Verbosity control for AI agents☆63Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆47Updated 7 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated 7 months ago
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 11 months ago
- Chat Markup Language conversation library☆55Updated last year
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated 8 months ago
- ☆20Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- look how they massacred my boy☆63Updated 7 months ago
- ☆72Updated last week
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 6 months ago
- ☆38Updated 9 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 6 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆38Updated 3 months ago
- ☆48Updated 6 months ago
- Lego for GRPO☆28Updated last month
- ☆45Updated 8 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Updated 7 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated last month
- Very minimal (and stateless) agent framework☆44Updated 4 months ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Interactive timeline of AI history☆51Updated last month
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆37Updated last week
- alternative way to calculating self attention☆18Updated 11 months ago