VictorTaelin / ab_challenge_evalLinks
Evaluator for the A::B Prompting Challenge
☆27Updated last year
Alternatives and similar repositories for ab_challenge_eval
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
Sorting:
- explore token trajectory trees on instruct and base models☆134Updated last month
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated 2 years ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆266Updated 8 months ago
- LLM verified with Monte Carlo Tree Search☆276Updated 3 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆69Updated 6 months ago
- Certified Reasoning with Language Models☆31Updated last year
- look how they massacred my boy☆63Updated 9 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- a curated list of data for reasoning ai☆136Updated 11 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated 8 months ago
- ☆122Updated 11 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆74Updated 7 months ago
- LLMs + Lean, on your laptop or in the cloud☆169Updated last month
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]☆63Updated last month
- ☆96Updated 7 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated 11 months ago
- Training GPTs to solve interaction nets☆17Updated 11 months ago
- Formalizing stochastic doubly-efficient debate☆107Updated 9 months ago
- ☆23Updated 7 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- RASP-L in Haskell for my fellow rascals☆19Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 5 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆70Updated 5 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆135Updated 2 months ago
- ☆87Updated 6 months ago
- Exponent pair database☆59Updated this week
- a categorical deep learning compiler☆203Updated 4 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆32Updated last year
- Fast parallel LLM inference for MLX☆198Updated last year