VictorTaelin / ab_challenge_evalLinks
Evaluator for the A::B Prompting Challenge
☆27Updated last year
Alternatives and similar repositories for ab_challenge_eval
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
Sorting:
- explore token trajectory trees on instruct and base models☆132Updated 3 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆271Updated 10 months ago
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated 2 years ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆71Updated last year
- LLM verified with Monte Carlo Tree Search☆281Updated 5 months ago
- look how they massacred my boy☆64Updated 10 months ago
- Fast parallel LLM inference for MLX☆212Updated last year
- a curated list of data for reasoning ai☆137Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆180Updated last month
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- ☆90Updated 7 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated 9 months ago
- smol models are fun too☆93Updated 9 months ago
- Training GPTs to solve interaction nets☆17Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 6 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated last year
- LLMs + Lean, on your laptop or in the cloud☆176Updated last month
- ☆99Updated 9 months ago
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆70Updated 7 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆188Updated 3 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆287Updated 3 weeks ago
- Harmonic Datasets☆46Updated last year
- Simple Transformer in Jax☆140Updated last year
- ☆123Updated last year
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆76Updated 8 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆73Updated 6 months ago
- Formalizing stochastic doubly-efficient debate☆108Updated 10 months ago
- Certified Reasoning with Language Models☆31Updated last year
- Editor with LLM generation tree exploration☆74Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 6 months ago