SakanaAI / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆186Updated 10 months ago
Alternatives and similar repositories for DiscoPOP:
Users that are interested in DiscoPOP are comparing it to the libraries listed below
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆61Updated 10 months ago
- smolLM with Entropix sampler on pytorch☆151Updated 6 months ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆104Updated 3 months ago
- ☆97Updated 6 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆307Updated 6 months ago
- ☆12Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆110Updated 6 months ago
- CycleQD is a framework for parameter space model merging.☆39Updated 3 months ago
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆43Updated this week
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆55Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆70Updated 4 months ago
- ☆65Updated 3 weeks ago
- GRadient-INformed MoE☆262Updated 7 months ago
- ☆80Updated 4 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆122Updated last month
- Getting crystal-like representations with harmonic loss☆183Updated last month
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 4 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆92Updated 2 weeks ago
- ☆47Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆24Updated 3 months ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 3 months ago
- ☆109Updated 4 months ago
- ☆129Updated 8 months ago
- Automating the Search for Artificial Life with Foundation Models!☆409Updated 4 months ago
- 0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" i…☆96Updated last year
- Code for ExploreTom☆83Updated 4 months ago