☆27Aug 25, 2023Updated 2 years ago
Alternatives and similar repositories for CocktailSGD
Users that are interested in CocktailSGD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated 2 years ago
- ☆13May 4, 2026Updated last month
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago
- This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Lea…☆18May 5, 2026Updated last month
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Apr 29, 2024Updated 2 years ago
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆151Oct 29, 2024Updated last year
- summer school materials☆46Aug 4, 2023Updated 2 years ago
- ☆13May 25, 2022Updated 4 years ago
- ☆12Mar 31, 2020Updated 6 years ago
- The official implementation of TinyTrain [ICML '24]☆27Jul 19, 2024Updated last year
- crystalnet -- a mini core AI library (being refactored, see https://github.com/lgarithm/stdnn-ops)☆17Oct 1, 2019Updated 6 years ago
- Implementation of the FedPM framework by the authors of the ICLR 2023 paper "Sparse Random Networks for Communication-Efficient Federated…☆31Feb 10, 2023Updated 3 years ago
- ☆31Jul 22, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for paper "Byzantine-Resilient Decentralized Stochastic Optimization with Robust Aggregation Rules"☆20Apr 19, 2024Updated 2 years ago
- ☆19May 4, 2023Updated 3 years ago
- Aho-Corasick automation for large-scale multi-pattern matching. Available for C/C++, Python, and Java on Linux, macOS, and Windows.☆14Oct 29, 2024Updated last year
- Code for paper "Byzantine-Resilient Distributed Finite-Sum Optimization over Networks"☆18Nov 5, 2020Updated 5 years ago
- List Flower resources☆12Feb 4, 2022Updated 4 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 9 months ago
- ☆10Jun 19, 2023Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 3 years ago
- Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…☆11Mar 3, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆151Jun 2, 2023Updated 3 years ago
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆47Apr 21, 2026Updated last month
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆28Jul 18, 2025Updated 10 months ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated 2 years ago
- fast trainer for educational purposes☆26Updated this week
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated 2 years ago
- [ICLR24] Better Neural PDE Solvers Through Data-Free Mesh Movers☆17Mar 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)☆17Mar 6, 2025Updated last year
- A PyTorch native library for training speculative decoding models☆154Updated this week
- Implementation of (overlap) local SGD in Pytorch☆34Jul 12, 2020Updated 5 years ago
- ☆18Apr 8, 2026Updated 2 months ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 4 years ago
- Resources regarding evML (edge verified machine learning)☆23Jan 4, 2025Updated last year
- Efficient misspecification uncertainties for linear regression☆18May 19, 2026Updated 3 weeks ago