kyegomez / swarms-pytorchLinks
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch π
β121Updated this week
Alternatives and similar repositories for swarms-pytorch
Users that are interested in swarms-pytorch are comparing it to the libraries listed below
Sorting:
- Code for the paper "What's the Magic Word? A Control Theory of LLM Prompting"β106Updated 11 months ago
- β112Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β100Updated 5 months ago
- β130Updated 9 months ago
- β27Updated 10 months ago
- β80Updated last year
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- β111Updated 5 months ago
- Collection of autoregressive model implementationβ85Updated last month
- The history files when recording human interaction while solving ARC tasksβ110Updated last week
- β53Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingnessβ49Updated 4 months ago
- Ο-GPT: A New Approach to Autoregressive Modelsβ64Updated 9 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the userβ¦β172Updated last week
- β60Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ231Updated 7 months ago
- smolLM with Entropix sampler on pytorchβ150Updated 7 months ago
- β95Updated 4 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β81Updated last year
- β48Updated last year
- Full finetuning of large language models without large memory requirementsβ93Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyersβ66Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ197Updated last year
- β22Updated last year
- Training small GPT-2 style models using Kolmogorov-Arnold networks.β117Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for trβ¦β62Updated 7 months ago
- A Collection of Pydantic Models to Abstract IRLβ18Updated last week
- look how they massacred my boyβ63Updated 7 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- β49Updated last year