SakanaAI / DiscoPOP
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
☆166Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for DiscoPOP
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆51Updated 5 months ago
- smolLM with Entropix sampler on pytorch☆139Updated 3 weeks ago
- ☆95Updated last month
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆111Updated last week
- 🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.☆44Updated 5 months ago
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆64Updated last month
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆15Updated last month
- PyTorch implementation of models from the Zamba2 series.☆158Updated this week
- ☆54Updated this week
- ☆93Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- Function Calling Benchmark & Testing☆74Updated 4 months ago
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆93Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Updated 8 months ago
- ☆72Updated this week
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- Draw more samples☆179Updated 4 months ago
- An automated tool for discovering insights from research papaer corpora☆135Updated 5 months ago
- entropix style sampling + GUI☆25Updated 3 weeks ago
- ☆104Updated 8 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆157Updated 9 months ago
- ☆12Updated 4 months ago
- ☆64Updated 5 months ago
- look how they massacred my boy☆58Updated last month
- ☆83Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 5 months ago
- ☆18Updated last month
- General multi-task deep RL Agent☆165Updated 5 months ago
- Mamba training library developed by kotoba technologies☆68Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago