SakanaAI / DiscoPOPLinks
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆188Updated 11 months ago
Alternatives and similar repositories for DiscoPOP
Users that are interested in DiscoPOP are comparing it to the libraries listed below
Sorting:
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆62Updated 11 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 7 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆310Updated 7 months ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆106Updated 4 months ago
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆67Updated last month
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆121Updated this week
- CycleQD is a framework for parameter space model merging.☆40Updated 4 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆70Updated 5 months ago
- Plotting (entropy, varentropy) for small LMs☆97Updated 2 weeks ago
- Draw more samples☆191Updated 11 months ago
- ☆83Updated 5 months ago
- ☆68Updated last month
- PyTorch implementation of models from the Zamba2 series.☆182Updated 4 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆100Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- ☆47Updated 5 months ago
- Getting crystal-like representations with harmonic loss☆187Updated 2 months ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆29Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆87Updated last month
- ☆70Updated last month
- Code for ExploreTom☆83Updated 5 months ago
- Train your own SOTA deductive reasoning model☆93Updated 3 months ago
- The history files when recording human interaction while solving ARC tasks☆110Updated 2 weeks ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 4 months ago
- Token Omission Via Attention☆126Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- Project of llm evaluation to Japanese tasks☆83Updated last week
- The implementation of "Leeroo Orchestrator: Elevating LLMs Performance Through Model Integration"☆54Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆101Updated last month