kyegomez / swarms-pytorchLinks
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch π
β134Updated last month
Alternatives and similar repositories for swarms-pytorch
Users that are interested in swarms-pytorch are comparing it to the libraries listed below
Sorting:
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"β110Updated last year
- β112Updated 2 years ago
- Collection of autoregressive model implementationβ85Updated 7 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β103Updated 11 months ago
- A reinforcement learning framework based on MLX.β246Updated last week
- Memoria is a human-inspired memory architecture for neural networks.β78Updated last year
- β62Updated last year
- General multi-task deep RL Agentβ185Updated last year
- run paligemma in real timeβ133Updated last year
- β28Updated last year
- β136Updated last year
- Simple GRPO scripts and configurations.β59Updated 10 months ago
- β82Updated last year
- β164Updated last year
- Ο-GPT: A New Approach to Autoregressive Modelsβ70Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingnessβ57Updated 10 months ago
- β45Updated 2 years ago
- β86Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in theirβ¦β157Updated last month
- Genetics for Language Modelsβ17Updated last year
- An automated tool for discovering insights from research papaer corporaβ137Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.β122Updated last year
- β129Updated 11 months ago
- Plotting (entropy, varentropy) for small LMsβ99Updated 6 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ232Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- β40Updated last year
- β36Updated 4 months ago
- β94Updated 2 years ago
- β138Updated 3 months ago