A compressed adaptive optimizer for training large-scale deep learning models using PyTorch
☆25Nov 26, 2019Updated 6 years ago
Alternatives and similar repositories for Count-Sketch-Optimizers
Users that are interested in Count-Sketch-Optimizers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MISSION: Ultra Large-Scale Feature Selection using Count-Sketches☆13Oct 6, 2019Updated 6 years ago
- Sketched SGD☆28Jul 4, 2020Updated 5 years ago
- Simple Hierarchical Count Sketch in Python☆21Jun 3, 2021Updated 4 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- PyTorch for benchmarking communication-efficient distributed SGD optimization algorithms☆78Aug 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sparsified SGD with Memory: https://arxiv.org/abs/1809.07599☆58Oct 25, 2018Updated 7 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆17Oct 11, 2021Updated 4 years ago
- Code for testing DCT plus Sparse (DCTpS) networks☆14Jun 15, 2021Updated 4 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- ☆21Mar 7, 2024Updated 2 years ago
- gTop-k S-SGD: A Communication-Efficient Distributed Synchronous SGD Algorithm for Deep Learning☆37Aug 19, 2019Updated 6 years ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- SGD with compressed gradients and error-feedback: https://arxiv.org/abs/1901.09847☆32Jul 25, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- Multiclass and multilabel datasets in Vowpal Wabbit format☆12Apr 17, 2018Updated 8 years ago
- implement distributed machine learning with Pytorch + OpenMPI☆53Mar 22, 2019Updated 7 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- Source mirror of OpenPopulous, converted as per notes repo (2013-09-10)☆17Sep 10, 2013Updated 12 years ago
- Source code for SIGIR 2022 paper.☆16Apr 25, 2022Updated 4 years ago
- ☆13Oct 15, 2022Updated 3 years ago
- Code repository for "Spatiotemporal Traffic Matrix Synthesis", Paul Tune and Matthew Roughan, ACM SIGCOMM 2015, London, UK, August 2015.☆15Jan 13, 2016Updated 10 years ago
- ☆10Sep 3, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings☆157Jul 23, 2024Updated last year
- A Neovim client for VsCoq 2 vscoqtop.☆14Nov 8, 2025Updated 6 months ago
- LearnedSketch: Learning-Based Frequency Estimation Algorithms (ICLR 2019)☆32Mar 24, 2023Updated 3 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Nov 23, 2022Updated 3 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- A framework for index based similarity search.☆20May 10, 2019Updated 7 years ago
- bigcomputing☆33Nov 3, 2020Updated 5 years ago
- Convolutional 3D autoencoder☆14Aug 21, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Emotional probes for Gemma 4 E4B☆31Apr 8, 2026Updated last month
- Multi-index hashing for the resolution of ANN search problem on large datasets☆15Oct 16, 2018Updated 7 years ago
- Code for the signSGD paper☆94Jan 12, 2021Updated 5 years ago
- Dark channel Haze removal algorithm with CUDA acceleration (typically 10x or more speedup using a Nvidia GPU)☆14Dec 7, 2017Updated 8 years ago
- Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)☆182Nov 19, 2018Updated 7 years ago
- first attempt at description2code from 2016☆10Nov 15, 2018Updated 7 years ago
- FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning☆17Jun 2, 2022Updated 3 years ago