VictorTaelin / ab_challenge_eval
Evaluator for the A::B Prompting Challenge
☆26Updated 10 months ago
Alternatives and similar repositories for ab_challenge_eval:
Users that are interested in ab_challenge_eval are comparing it to the libraries listed below
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- Certified Reasoning with Language Models☆31Updated last year
- look how they massacred my boy☆63Updated 4 months ago
- a curated list of data for reasoning ai☆128Updated 6 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆80Updated last month
- Grounding LLM mathematical reasoning with proof assistants.☆63Updated last year
- The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)Updated 2 months ago
- ☆111Updated 2 months ago
- ☆24Updated last year
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 3 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆30Updated 2 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆164Updated this week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 9 months ago
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆30Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆129Updated this week
- ☆49Updated 8 months ago
- Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆28Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆22Updated 3 weeks ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆64Updated 2 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆36Updated 5 months ago
- ☆20Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 4 months ago
- ☆37Updated 6 months ago
- Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.☆245Updated 3 months ago
- Mistral7B playing DOOM☆28Updated 10 months ago
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆15Updated 9 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- Distributed Inference for mlx LLm☆82Updated 6 months ago