Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA
☆32Nov 27, 2025Updated 6 months ago
Alternatives and similar repositories for FSDP-Training
Users that are interested in FSDP-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offical implementation of "Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation" (AAAI2025 Oral)☆42Jan 14, 2026Updated 4 months ago
- [ICLR 2024] Official implementation of Spiking Graph Contrastive Learning (0️⃣1️⃣ SpikeGCL)☆33May 8, 2024Updated 2 years ago
- The main repository for the ICUAS 2022 UAV competition.☆18Jul 4, 2022Updated 3 years ago
- Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)☆51Apr 13, 2026Updated last month
- ☆31Aug 18, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆134Jun 24, 2025Updated 11 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- KeepGPU is a simple CLI app that keeps your GPUs running.☆36Mar 9, 2026Updated 3 months ago
- ☆17Feb 16, 2024Updated 2 years ago
- ☆34Oct 13, 2025Updated 7 months ago
- Odysseus: Playground of LLM Sequence Parallelism☆80Jun 17, 2024Updated last year
- Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)☆25Nov 18, 2020Updated 5 years ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆73May 21, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- POPGym Library in JAX☆14Apr 15, 2024Updated 2 years ago
- Integrates Imbue's Cost Aware pareto-Region Bayesian Search (CARBS) with Weights and Biases (WanDB)☆12Mar 17, 2025Updated last year
- ☆22Sep 16, 2025Updated 8 months ago
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- ☆10Jun 27, 2024Updated last year
- Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…☆12Feb 23, 2024Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 9 months ago
- ☆11Jan 21, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code of "AutoSNN: Towards Energy-Efficient Spiking Neural Networks," ICML22☆18May 29, 2022Updated 4 years ago
- ☆20Mar 18, 2026Updated 2 months ago
- Visual Question Answering System☆11Nov 13, 2019Updated 6 years ago
- Parallel Prefix Sum (Scan) with CUDA☆29Jun 22, 2024Updated last year
- ☆15Mar 2, 2025Updated last year
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 11 months ago
- ☆16Sep 22, 2024Updated last year
- ☆12Aug 26, 2022Updated 3 years ago
- ☆16Jul 16, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Log-Polar Space Convolution for Convolutional Neural Networks☆13Dec 12, 2022Updated 3 years ago
- 文本数据挖掘大作业,分别用朴素贝叶斯,SVM,情感词典,LSTM,textcnn实现情感分析☆16Jun 16, 2023Updated 2 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- Deep Reinforcement Learning for Autonomous Drone Navigation☆41Oct 31, 2025Updated 7 months ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023☆22Mar 12, 2026Updated 2 months ago
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 6 months ago