VictorTaelin / ab_challenge_eval
Evaluator for the A::B Prompting Challenge
☆27Updated 11 months ago
Alternatives and similar repositories for ab_challenge_eval:
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
- look how they massacred my boy☆63Updated 5 months ago
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 10 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 4 months ago
- Certified Reasoning with Language Models☆31Updated last year
- ☆20Updated 4 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆248Updated 4 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆59Updated 2 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated last month
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆68Updated last month
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 4 months ago
- ☆97Updated 5 months ago
- ☆27Updated 6 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- smol models are fun too☆90Updated 4 months ago
- Like ARC, but code to generate visual puzzles. 1D puzzles first.☆17Updated 7 months ago
- Mistral7B playing DOOM☆28Updated last year
- entropix style sampling + GUI☆25Updated 5 months ago
- ☆80Updated 2 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆29Updated 9 months ago
- a curated list of data for reasoning ai☆132Updated 7 months ago
- Letting Claude Code develop his own MCP tools :)☆91Updated 3 weeks ago
- Simple demo showing how to use the Forge API by Nous Research☆11Updated 4 months ago
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆31Updated 3 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆138Updated last month
- this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.☆19Updated 3 months ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆15Updated 10 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated last week
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆65Updated 3 months ago