Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)
☆17Mar 6, 2025Updated last year
Alternatives and similar repositories for Grams
Users that are interested in Grams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 2, 2025Updated last year
- ☆13Jan 15, 2025Updated last year
- [AAAI-25 Oral] Adaptive Calibration☆15Jul 6, 2025Updated 8 months ago
- Original DVS128 Gesture Dataset in PyTorch☆13Jun 6, 2023Updated 2 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆18May 28, 2025Updated 9 months ago
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Examples to control the Opal C1 from within python.☆17May 7, 2023Updated 2 years ago
- ☆256Dec 2, 2024Updated last year
- Repository for "CIRA Guide to Custom Loss Functions for Neural Networks in Environmental Sciences"☆17Jun 17, 2021Updated 4 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- Fractional Spike Differential Equations Neural Network with Efficient Adjoint Parameters Training☆16Aug 6, 2025Updated 7 months ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- Efficient misspecification uncertainties for linear regression☆17Updated this week
- torch implementation of diloco☆22May 31, 2024Updated last year
- Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.☆12May 18, 2023Updated 2 years ago
- ☆33Apr 22, 2025Updated 11 months ago
- SuperCLUE高考作文机器自动阅卷系统☆18Jun 8, 2023Updated 2 years ago
- ☆23Jan 5, 2025Updated last year
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆26Aug 27, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆18Oct 22, 2019Updated 6 years ago
- Course Project for CS224W at Stanford☆22Dec 10, 2021Updated 4 years ago
- MLX implementation of xLSTM model by Beck et al. (2024)☆32Jun 5, 2024Updated last year
- ☆21Apr 12, 2024Updated last year
- [ICCV2023] Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networks☆45Aug 2, 2023Updated 2 years ago
- Repository for Deep Learning Theory papers☆15Jan 24, 2024Updated 2 years ago
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Apr 19, 2024Updated last year
- A Python Data Valuation Package☆34Feb 3, 2023Updated 3 years ago
- ☆35Mar 12, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆364Nov 15, 2025Updated 4 months ago
- xast utility to build feeds (rss, atom)☆10Jul 19, 2023Updated 2 years ago
- 【OUTDATED】北京科技大学选课系统第三方Windows、Linux、Mac、Android、iOS客户端☆19Sep 30, 2016Updated 9 years ago
- Generate v4 UUIDs using libsodium's RNG☆11Jun 16, 2020Updated 5 years ago
- ☆27Aug 25, 2023Updated 2 years ago
- The official implementation of TinyTrain [ICML '24]☆24Jul 19, 2024Updated last year