VictorTaelin / ab_challenge_evalLinks

Evaluator for the A::B Prompting Challenge

☆27

Alternatives and similar repositories for ab_challenge_eval

Users that are interested in ab_challenge_eval are comparing it to the libraries listed below

Sorting:

atroyn / math-llm
Grounding LLM mathematical reasoning with proof assistants.
☆63Updated 2 years ago
kenshin9000 / ConceptARC-Representations
This repository explains and provides examples for "concept anchoring" in GPT4.
☆72Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
vgel / logitloom
explore token trajectory trees on instruct and base models
☆127Updated last month
reissbaker / clevergpt
Training GPTs to solve interaction nets
☆17Updated 10 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆50Updated 7 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
neurallambda / neurallambda
Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
☆261Updated 7 months ago
neurallambda / awesome-reasoning
a curated list of data for reasoning ai
☆136Updated 10 months ago
google-deepmind / debate
Formalizing stochastic doubly-efficient debate
☆107Updated 8 months ago
lechmazur / divergent
LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…
☆31Updated 3 months ago
jxmorris12 / gptzip
Losslessly encode text natively with arithmetic coding and HuggingFace Transformers
☆76Updated 11 months ago
gpoesia / certified-reasoning
Certified Reasoning with Language Models
☆31Updated last year
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 7 months ago
joshuacnf / Ctrl-G
☆86Updated 5 months ago
teknium1 / ShareGPT-Builder
☆115Updated 6 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆70Updated 4 months ago
maxsloef / loom-mcp
this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.
☆20Updated 6 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 4 months ago
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
facebookresearch / LeanUniverse
LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management
☆64Updated 5 months ago
westoncb / latent-langs
☆27Updated 9 months ago
doomslide / attention-graph
A graph visualization of attention
☆56Updated last month
GoodAI / goodai-ltm-benchmark
A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…
☆73Updated 6 months ago
agemoai / arcsolver
A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.
☆22Updated 5 months ago
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 7 months ago
doomslide / baby-compiler
It's a baby compiler. (Lean btw.)
☆16Updated last month
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆175Updated last month
namin / llm-verified-with-monte-carlo-tree-search
LLM verified with Monte Carlo Tree Search
☆276Updated 2 months ago