aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆96Updated this week
Related projects ⓘ
Alternatives and complementary repositories for AidanBench
- look how they massacred my boy☆58Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- smolLM with Entropix sampler on pytorch☆139Updated 2 weeks ago
- ☆94Updated last month
- Simple Transformer in Jax☆119Updated 4 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆151Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last month
- smol models are fun too☆77Updated last week
- ☆104Updated 8 months ago
- ☆66Updated 2 weeks ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated 2 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆282Updated last month
- ☆74Updated 3 weeks ago
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- ☆101Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆48Updated 2 weeks ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆49Updated 3 weeks ago
- ☆20Updated 2 weeks ago
- run embeddings in MLX☆73Updated last month
- An introduction to LLM Sampling☆64Updated last week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆48Updated last year
- Sphynx Hallucination Induction☆48Updated 3 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 6 months ago
- train entropix like a champ!☆20Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆63Updated this week
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated 10 months ago
- ☆118Updated 3 months ago
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago