SakanaAI / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆177Updated 7 months ago
Alternatives and similar repositories for DiscoPOP:
Users that are interested in DiscoPOP are comparing it to the libraries listed below
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆55Updated 7 months ago
- smolLM with Entropix sampler on pytorch☆148Updated 2 months ago
- ☆96Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆282Updated 3 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆119Updated this week
- σ-GPT: A New Approach to Autoregressive Models☆61Updated 5 months ago
- ☆46Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆27Updated 10 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆67Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆158Updated 2 weeks ago
- CycleQD is a framework for parameter space model merging.☆29Updated last month
- 🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.☆52Updated 7 months ago
- General multi-task deep RL Agent☆174Updated 7 months ago
- Code for ExploreTom☆71Updated last month
- Alice in Wonderland code base for experiments and raw experiments data☆120Updated this week
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 2 weeks ago
- ☆98Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆291Updated last month
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆92Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆120Updated this week
- Automating the Search for Artificial Life with Foundation Models!☆348Updated 2 weeks ago
- ☆12Updated 6 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆177Updated last month
- An automated tool for discovering insights from research papaer corpora☆136Updated 7 months ago
- ☆60Updated last year
- GRadient-INformed MoE☆261Updated 4 months ago
- ☆80Updated 3 weeks ago
- TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices☆168Updated 2 months ago