SakanaAI / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆184Updated 9 months ago
Alternatives and similar repositories for DiscoPOP:
Users that are interested in DiscoPOP are comparing it to the libraries listed below
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆61Updated 9 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 4 months ago
- ☆97Updated 5 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆121Updated last month
- Code for ExploreTom☆76Updated 3 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆93Updated 6 months ago
- 🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.☆55Updated 9 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆296Updated 4 months ago
- Draw more samples☆186Updated 8 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆159Updated last month
- General multi-task deep RL Agent☆177Updated 9 months ago
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- ☆101Updated 2 months ago
- Automating the Search for Artificial Life with Foundation Models!☆387Updated 2 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆69Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆139Updated last month
- PyTorch implementation of models from the Zamba2 series.☆177Updated last month
- Alice in Wonderland code base for experiments and raw experiments data☆128Updated last month
- ☆98Updated 6 months ago
- ☆60Updated last year
- Video+code lecture on building nanoGPT from scratch☆65Updated 9 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆165Updated last week
- ☆12Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆28Updated 11 months ago