kyegomez / swarms-pytorchLinks
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch π
β132Updated last week
Alternatives and similar repositories for swarms-pytorch
Users that are interested in swarms-pytorch are comparing it to the libraries listed below
Sorting:
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"β110Updated last year
- A reinforcement learning framework based on MLX.β241Updated 2 weeks ago
- Ο-GPT: A New Approach to Autoregressive Modelsβ68Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β102Updated 10 months ago
- β112Updated last year
- General multi-task deep RL Agentβ185Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in theirβ¦β155Updated 2 weeks ago
- run paligemma in real timeβ133Updated last year
- Collection of autoregressive model implementationβ86Updated 6 months ago
- β136Updated last year
- β36Updated 2 months ago
- Memoria is a human-inspired memory architecture for neural networks.β76Updated last year
- β81Updated last year
- smolLM with Entropix sampler on pytorchβ150Updated 11 months ago
- β123Updated last year
- β124Updated 10 months ago
- β28Updated last year
- Simple GRPO scripts and configurations.β59Updated 8 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.β121Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ109Updated 10 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingnessβ57Updated 9 months ago
- An automated tool for discovering insights from research papaer corporaβ138Updated last year
- Plotting (entropy, varentropy) for small LMsβ98Updated 5 months ago
- β46Updated 2 years ago
- look how they massacred my boyβ63Updated last year
- Clean RL implementation using MLXβ33Updated last year
- The history files when recording human interaction while solving ARC tasksβ117Updated 2 weeks ago
- β40Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ150Updated last year