andyk / headlongLinks
A framework for collecting a large human-sourced chain-of-thoughts dataset
☆27Updated last year
Alternatives and similar repositories for headlong
Users that are interested in headlong are comparing it to the libraries listed below
Sorting:
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- ☆67Updated 8 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆133Updated 10 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- ☆73Updated last week
- ☆67Updated 6 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆88Updated 11 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Updated 6 months ago
- ☆51Updated 5 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆65Updated 9 months ago
- look how they massacred my boy☆63Updated last year
- Verbosity control for AI agents☆66Updated last year
- Chat Markup Language conversation library☆55Updated 2 years ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- ☆40Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Updated last year
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆27Updated 8 months ago
- ☆19Updated last year
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"☆111Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆189Updated last week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 4 months ago
- Sphynx Hallucination Induction☆53Updated last year
- ScalarLM - a unified training and inference stack☆96Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆37Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- ☆85Updated last year