Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆125Apr 21, 2025Updated 10 months ago
Alternatives and similar repositories for training-hot-swap
Users that are interested in training-hot-swap are comparing it to the libraries listed below
Sorting:
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated 10 months ago
- Tensor library & inference framework for machine learning☆116Oct 3, 2025Updated 5 months ago
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆80Jul 30, 2025Updated 7 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- CECS 342 Lab 4: Logic Languages with SWI-Prolog☆13Nov 19, 2021Updated 4 years ago
- This is a python implementation for stitching images.☆231Oct 3, 2024Updated last year
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆342Oct 24, 2025Updated 4 months ago
- ☆36Feb 6, 2026Updated last month
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆25Aug 5, 2025Updated 7 months ago
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 8 months ago
- Recording and thinking when read the paper about PersonReID.☆10Jan 10, 2019Updated 7 years ago
- A minimal tensor processing unit (TPU), inspired by Google's TPU V2 and V1☆1,181Mar 1, 2026Updated last week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆145Sep 12, 2025Updated 5 months ago
- Code to go along with Separating Axis Test blog☆59Jul 19, 2025Updated 7 months ago
- Efficient optimizers☆285Dec 20, 2025Updated 2 months ago
- ☆16Nov 21, 2017Updated 8 years ago
- ☆18Apr 10, 2023Updated 2 years ago
- Simple function for local patch extraction from OpenCV keypoints.☆14Jul 10, 2019Updated 6 years ago
- Triton kernels for Flux☆22Jul 7, 2025Updated 8 months ago
- ☆19Mar 25, 2025Updated 11 months ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- RISC-V assembler/simulator with GUI☆14Jul 31, 2022Updated 3 years ago
- Animating R1's thoughts.☆382Feb 17, 2025Updated last year
- ☆17Apr 14, 2023Updated 2 years ago
- This repository contains the implementation of **Alternators**, a novel family of generative models for time-dependent data.☆35Jun 6, 2025Updated 9 months ago
- A dashboard for exploring timm learning rate schedulers☆19Nov 22, 2024Updated last year
- The Oceanographic Multi-purpose Software Environment: a package for multi-physics and multi-scale earth science simulations.☆19Sep 3, 2024Updated last year
- A love2d module that enables hot code reloading☆15Sep 5, 2017Updated 8 years ago
- Code to test the raw overhead of a JNI call, as opposed to calling the method from C☆19Feb 18, 2011Updated 15 years ago
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 7 months ago
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- ☆128May 26, 2025Updated 9 months ago
- ☆1,078May 18, 2025Updated 9 months ago
- us cached road graph, freeways, primary and secondary roads☆193Jan 8, 2025Updated last year
- ☆23Mar 25, 2024Updated last year
- R.L. methods and techniques.☆199Feb 28, 2026Updated last week
- Encrypted environment variables☆185Nov 10, 2021Updated 4 years ago