Ziems / arborLinks
A framework for optimizing DSPy programs with RL
☆182Updated last week
Alternatives and similar repositories for arbor
Users that are interested in arbor are comparing it to the libraries listed below
Sorting:
- Inference-time scaling for LLMs-as-a-judge.☆299Updated last month
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆301Updated 2 weeks ago
- ☆68Updated 4 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆89Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆98Updated 2 months ago
- Training-Ready RL Environments + Evals☆111Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆128Updated 5 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆329Updated 3 weeks ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆55Updated 4 months ago
- ☆157Updated 9 months ago
- ☆133Updated 6 months ago
- Train your own SOTA deductive reasoning model☆107Updated 6 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- ☆76Updated 3 weeks ago
- Simple UI for debugging correlations of text embeddings☆291Updated 4 months ago
- Storing long contexts in tiny caches with self-study☆192Updated 2 weeks ago
- Routing on Random Forest (RoRF)☆211Updated last year
- Sphynx Hallucination Induction☆53Updated 8 months ago
- Claude Deep Research config for Claude Code.☆219Updated 6 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆52Updated 4 months ago
- OSS RL environment + evals toolkit☆181Updated this week
- ☆70Updated 3 weeks ago
- A small library of LLM judges☆287Updated 2 months ago
- ☆28Updated 3 months ago
- A user interface for DSPy☆181Updated 4 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆58Updated 3 weeks ago
- look how they massacred my boy☆64Updated 11 months ago