VictorTaelin / ab_challenge_evalLinks
Evaluator for the A::B Prompting Challenge
☆27Updated last year
Alternatives and similar repositories for ab_challenge_eval
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
Sorting:
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated 2 years ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- explore token trajectory trees on instruct and base models☆127Updated last month
- Training GPTs to solve interaction nets☆17Updated 10 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 7 months ago
- look how they massacred my boy☆63Updated 8 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆261Updated 7 months ago
- a curated list of data for reasoning ai☆136Updated 10 months ago
- Formalizing stochastic doubly-efficient debate☆107Updated 8 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆31Updated 3 months ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated 11 months ago
- Certified Reasoning with Language Models☆31Updated last year
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- ☆86Updated 5 months ago
- ☆115Updated 6 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆70Updated 4 months ago
- this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.☆20Updated 6 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 4 months ago
- ☆38Updated 11 months ago
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆64Updated 5 months ago
- ☆27Updated 9 months ago
- A graph visualization of attention☆56Updated last month
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆73Updated 6 months ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆22Updated 5 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- It's a baby compiler. (Lean btw.)☆16Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆175Updated last month
- LLM verified with Monte Carlo Tree Search☆276Updated 2 months ago