VictorTaelin / ab_challenge_eval
Evaluator for the A::B Prompting Challenge
☆26Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for ab_challenge_eval
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆71Updated 3 months ago
- Grounding LLM mathematical reasoning with proof assistants.☆60Updated last year
- look how they massacred my boy☆58Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated 10 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆100Updated this week
- a curated list of data for reasoning ai☆113Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆56Updated 3 weeks ago
- ☆74Updated 3 weeks ago
- ☆104Updated 8 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 6 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆57Updated 4 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆152Updated last week
- A repo to evaluate various LLM's chess playing abilities.☆69Updated 7 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated 2 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆58Updated last month
- Fluid Database☆114Updated 2 months ago
- A library for building software agents using behavior trees and language models.☆75Updated 6 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆236Updated 3 weeks ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 5 months ago
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- ☆72Updated last year
- ☆51Updated 3 weeks ago
- ☆120Updated last month
- entropix style sampling + GUI☆25Updated 3 weeks ago
- Replace expensive LLM calls with finetunes automatically☆62Updated 9 months ago
- ☆229Updated last month