Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆125Apr 21, 2025Updated last year
Alternatives and similar repositories for training-hot-swap
Users that are interested in training-hot-swap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NanoGPT-speedrunning for the poor T4 enjoyers☆74Apr 22, 2025Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆83Mar 16, 2026Updated 2 months ago
- ☆11Apr 30, 2025Updated last year
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆341Oct 24, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is a python implementation for stitching images.☆231Oct 3, 2024Updated last year
- ☆48Apr 2, 2025Updated last year
- A flat container abstraction for Rust☆17Nov 24, 2025Updated 6 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- ☆40Dec 1, 2022Updated 3 years ago
- Small, simple agent task environments for training and evaluation☆20Nov 1, 2024Updated last year
- R.L. methods and techniques.☆199Updated this week
- Research framework that quantifies how steganographic obfuscation of embeddings defeats off-the-shelf statistical detection in RAG pipeli…☆74May 19, 2026Updated 3 weeks ago
- ☆21Mar 3, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fast Polar Decomposition for Muon☆152May 2, 2026Updated last month
- CECS 342 Lab 4: Logic Languages with SWI-Prolog☆13Nov 19, 2021Updated 4 years ago
- Triton kernels for Flux☆23Jul 7, 2025Updated 11 months ago
- Prompts and evaluation data for LLMs on real world coding and writing tasks☆17Sep 13, 2025Updated 8 months ago
- Code to go along with Separating Axis Test blog☆58Jul 19, 2025Updated 10 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆39Jan 4, 2025Updated last year
- Official Github repository for Neural Film Grain Rendering☆19Feb 3, 2026Updated 4 months ago
- LLMProc: Unix-inspired runtime that treats LLMs as processes.☆34Jul 17, 2025Updated 10 months ago
- Cholidean Harmony Structure☆33Apr 16, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Jan 5, 2025Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆153Sep 12, 2025Updated 8 months ago
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆21Jul 2, 2024Updated last year
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40May 23, 2026Updated 2 weeks ago
- Focused on fast experimentation and simplicity☆79Dec 24, 2024Updated last year
- Visualize the intermediate output of Mistral 7B☆394Jan 22, 2025Updated last year
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆26Aug 5, 2025Updated 10 months ago
- Efficient optimizers☆330May 13, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- code for training & evaluating Contextual Document Embedding models☆205May 14, 2025Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆361May 21, 2025Updated last year
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated last year
- Official PyTorch Implementation for Meaning Representations from Trajectories in Autoregressive Models (ICLR 2024)☆22May 14, 2024Updated 2 years ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆30Jul 24, 2025Updated 10 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- ☆23Mar 25, 2024Updated 2 years ago