VictorTaelin / ab_challenge_evalLinks
Evaluator for the A::B Prompting Challenge
☆27Updated last year
Alternatives and similar repositories for ab_challenge_eval
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
Sorting:
- explore token trajectory trees on instruct and base models☆125Updated last week
- Grounding LLM mathematical reasoning with proof assistants.☆62Updated last year
- look how they massacred my boy☆63Updated 7 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆64Updated 4 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated 2 months ago
- A graph visualization of attention☆55Updated 2 weeks ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆259Updated 7 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 7 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆69Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Plotting (entropy, varentropy) for small LMs☆97Updated 2 weeks ago
- ☆50Updated last month
- Simple demo showing how to use the Forge API by Nous Research☆11Updated 6 months ago
- anything you want can be built with morph cloud☆12Updated last month
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- ☆54Updated 4 months ago
- ☆114Updated 5 months ago
- ☆111Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 3 months ago
- a curated list of data for reasoning ai☆136Updated 10 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆172Updated last week
- ☆83Updated 5 months ago
- smol models are fun too☆93Updated 6 months ago
- The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)☆1Updated 5 months ago
- Certified Reasoning with Language Models☆31Updated last year
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- LLM verified with Monte Carlo Tree Search☆275Updated 2 months ago