☆27Aug 25, 2023Updated 2 years ago
Alternatives and similar repositories for CocktailSGD
Users that are interested in CocktailSGD are comparing it to the libraries listed below
Sorting:
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Apr 25, 2023Updated 2 years ago
- ☆13Jan 15, 2025Updated last year
- This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Lea…☆14Aug 2, 2022Updated 3 years ago
- ☆19May 4, 2023Updated 2 years ago
- crystalnet -- a mini core AI library (being refactored, see https://github.com/lgarithm/stdnn-ops)☆17Oct 1, 2019Updated 6 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727☆149Oct 29, 2024Updated last year
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆33Jan 30, 2026Updated last month
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆41May 13, 2025Updated 9 months ago
- GPU operators for sparse tensor operations☆35Mar 11, 2024Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆34Apr 27, 2023Updated 2 years ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆150Jun 2, 2023Updated 2 years ago
- ☆54Dec 17, 2025Updated 2 months ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Efficient misspecification uncertainties for linear regression☆16Feb 19, 2026Updated last week
- ☆16Jul 23, 2023Updated 2 years ago
- netbeacon - monitoring your network capture, NIDS or network analysis process☆19Oct 26, 2013Updated 12 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Updated this week
- A Cython library to solve the Bittensor registration POW on CUDA☆15Aug 15, 2025Updated 6 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ICLR 2023: Learning to Extrapolate: A Transductive Approach☆11Aug 15, 2023Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Proxify Molotov.tv DRM to share content publicly☆10Jun 24, 2020Updated 5 years ago
- Offline RandomAPI npm module☆12Apr 22, 2018Updated 7 years ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- Python library for real-time vote prediction☆11Mar 7, 2021Updated 4 years ago
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 3 months ago
- How to plot for papers, slides, demos, etc.☆10Apr 7, 2022Updated 3 years ago
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆14Dec 2, 2021Updated 4 years ago
- Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing☆14Mar 28, 2023Updated 2 years ago