kyegomez / swarms-pytorchLinks
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch π
β132Updated 3 weeks ago
Alternatives and similar repositories for swarms-pytorch
Users that are interested in swarms-pytorch are comparing it to the libraries listed below
Sorting:
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"β110Updated last year
- β28Updated last year
- Memoria is a human-inspired memory architecture for neural networks.β77Updated last year
- A reinforcement learning framework based on MLX.β242Updated last month
- β112Updated last year
- β136Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β103Updated 10 months ago
- Collection of autoregressive model implementationβ86Updated 6 months ago
- Training small GPT-2 style models using Kolmogorov-Arnold networks.β121Updated last year
- run paligemma in real timeβ133Updated last year
- General multi-task deep RL Agentβ185Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- Simple GRPO scripts and configurations.β59Updated 9 months ago
- β94Updated 2 years ago
- β61Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in theirβ¦β156Updated last month
- β81Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingnessβ57Updated 9 months ago
- Ο-GPT: A New Approach to Autoregressive Modelsβ68Updated last year
- smolLM with Entropix sampler on pytorchβ150Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β181Updated last week
- Exploration into the proposed architecture from Sapient Intelligence of Singapore πΈπ¬β70Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β107Updated 8 months ago
- Automated Capability Discovery via Foundation Model Self-Explorationβ65Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ231Updated last year
- β40Updated last year
- β124Updated last year
- β45Updated 2 years ago
- This repository contains a better implementation of Kolmogorov-Arnold networksβ63Updated 5 months ago
- β36Updated 3 months ago