kyegomez / swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch π
β122Updated last month
Alternatives and similar repositories for swarms-pytorch:
Users that are interested in swarms-pytorch are comparing it to the libraries listed below
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"β106Updated 10 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β99Updated 4 months ago
- β112Updated last year
- Collection of autoregressive model implementationβ85Updated 2 weeks ago
- β129Updated 8 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β171Updated this week
- A reinforcement learning framework based on MLX.β233Updated 2 months ago
- The history files when recording human interaction while solving ARC tasksβ109Updated last week
- run paligemma in real timeβ131Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β42Updated 11 months ago
- β27Updated 10 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingnessβ45Updated 3 months ago
- β48Updated last year
- β81Updated last year
- β117Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- A Collection of Pydantic Models to Abstract IRLβ18Updated 2 weeks ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ198Updated last year
- β61Updated last year
- Ο-GPT: A New Approach to Autoregressive Modelsβ64Updated 8 months ago
- β97Updated 6 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ231Updated 6 months ago
- β31Updated 2 months ago
- Cerule - A Tiny Mighty Vision Modelβ67Updated 8 months ago
- Memoria is a human-inspired memory architecture for neural networks.β71Updated 6 months ago
- A MAD laboratory to improve AI architecture designs π§ͺβ114Updated 4 months ago
- look how they massacred my boyβ63Updated 6 months ago
- inference code for mixtral-8x7b-32kseqlenβ100Updated last year
- β38Updated 9 months ago