Grams: Gradient Descent with Adaptive Momentum Scaling (ICLR 2025 Workshop)
☆17Mar 6, 2025Updated last year
Alternatives and similar repositories for Grams
Users that are interested in Grams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 2, 2025Updated last year
- ☆13May 4, 2026Updated 3 weeks ago
- ☆22Nov 21, 2024Updated last year
- [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models☆39Nov 4, 2025Updated 6 months ago
- Socks5 Proxy based on Websocket.☆14Jul 10, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization (IEEE TPAMI 2021)☆17Jun 4, 2021Updated 4 years ago
- Mono.CSharp with edits for Unity modding☆12Jul 27, 2020Updated 5 years ago
- Useful utilities for huggingface☆25Dec 26, 2025Updated 4 months ago
- Swiftly get tons of images from indexed tars on Huggingface☆79Dec 19, 2024Updated last year
- Fractional Spike Differential Equations Neural Network with Efficient Adjoint Parameters Training☆16Aug 6, 2025Updated 9 months ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- Efficient misspecification uncertainties for linear regression☆18Updated this week
- succinct and unrestricted reflection☆14Mar 3, 2023Updated 3 years ago
- ☆36Dec 7, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- torch implementation of diloco☆23May 31, 2024Updated last year
- Continue to develop the dnSpy project☆12Apr 19, 2022Updated 4 years ago
- Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.☆12May 18, 2023Updated 3 years ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆437Dec 12, 2024Updated last year
- SuperCLUE高考作文机器自动阅卷系统☆19Jun 8, 2023Updated 2 years ago
- ☆19Jun 9, 2021Updated 4 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆18Oct 22, 2019Updated 6 years ago
- PyTorch optimizer based on nonlinear conjugate gradient method☆31Apr 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for Deep Learning Theory papers☆15Jan 24, 2024Updated 2 years ago
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- Easily share your custom workflows for anyone to run☆22Oct 17, 2024Updated last year
- Godot 4.0 插件,用于 B 站直播弹幕信息提取☆14Jan 16, 2023Updated 3 years ago
- Prompt Studio MidJourney提示词可视化编辑与管理工具☆28Apr 25, 2026Updated last month
- ☆36Mar 12, 2025Updated last year
- The code for Differentiable Linearized ADMM (ICML 2019)☆36Oct 9, 2019Updated 6 years ago
- ☆28Aug 1, 2025Updated 9 months ago
- The code for the paper "QuAFL: Federated Averaging Can Be Both Asynchronous and Communication-Efficient"☆17Mar 26, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A collection of niche / personally useful PyTorch optimizers with modified code.☆28Apr 14, 2026Updated last month
- xast utility to build feeds (rss, atom)☆10Jul 19, 2023Updated 2 years ago
- Generate v4 UUIDs using libsodium's RNG☆11Jun 16, 2020Updated 5 years ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆36Jun 20, 2024Updated last year
- Generate a menu with selectable menu items as a string☆12Dec 26, 2018Updated 7 years ago
- Application level sharding made simple!☆10Mar 25, 2017Updated 9 years ago
- The official implementation of TinyTrain [ICML '24]☆27Jul 19, 2024Updated last year