Enealor/PyTorch-SM3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Enealor/PyTorch-SM3)

Enealor / PyTorch-SM3

Implements the SM3-II adaptive optimization algorithm for PyTorch.

☆33

Alternatives and similar repositories for PyTorch-SM3

Users that are interested in PyTorch-SM3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dariush-bahrami / gravity.optimizer
View on GitHub
Deep Learning Gravity Optimizer Source Code Repository
☆14Jul 26, 2021Updated 5 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
NZ99 / transformer_in_transformer_flax
View on GitHub
☆21Mar 14, 2021Updated 5 years ago
OliverRichter / normalized-attention
View on GitHub
Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kinoshitadaisuke / ncu_astroinformatics_202209
View on GitHub
The repository for the course "Astroinformatics" offered at Institute of Astronomy, National Central University, from Sep/2022 to Jan/202…
☆10Jun 4, 2024Updated 2 years ago
EleutherAI / magiCARP
View on GitHub
One stop shop for all things carp
☆58Sep 9, 2022Updated 3 years ago
nestordemeure / AdaHessianJax
View on GitHub
Jax implementation of the AdaHessian optimizer
☆19Mar 11, 2021Updated 5 years ago
facebookresearch / task_bench
View on GitHub
The TaskBench500 dataset and code for generating tasks.
☆16Jul 16, 2022Updated 4 years ago
lucidrains / ponder-transformer
View on GitHub
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆84Oct 30, 2021Updated 4 years ago
rsonthal / TreeRep
View on GitHub
Learning Tree structures and Tree metrics
☆24Aug 8, 2024Updated last year
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
GitGyun / chameleon
View on GitHub
[ECCV'24 Oral] Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild
☆13Mar 13, 2025Updated last year
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qdu1995 / DQSD
View on GitHub
☆11Jun 27, 2021Updated 5 years ago
sooheon / hangul-utils
View on GitHub
A Clojure library for deconstructing Korean unicode syllable characters into alphabet characters
☆10Nov 22, 2021Updated 4 years ago
abhibambhaniya / progressive_gradient_flow_nm_sparsity
View on GitHub
Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".
☆11Feb 5, 2024Updated 2 years ago
Medabid1 / I2I-LR
View on GitHub
Official pytorch implementation of I2I translation with low resolution conditioning
☆23Sep 2, 2021Updated 4 years ago
tyler-hayes / Stream-51
View on GitHub
The Stream-51 dataset for streaming classification and novelty detection from videos.
☆17Feb 22, 2022Updated 4 years ago
Optimization-AI / SogCLR
View on GitHub
Stochastic Optimization for Global Contrastive Learning without Large Mini-batches
☆20Mar 31, 2023Updated 3 years ago
ChrisHayduk / qlora-multi-gpu
View on GitHub
QLoRA with Enhanced Multi GPU Support
☆38Aug 8, 2023Updated 2 years ago
kanezaki / MIRO
View on GitHub
☆13Apr 16, 2018Updated 8 years ago
istoony / winograd-convolutional-nn
View on GitHub
I'm going to use the Winograd’s minimal ﬁltering algorithms to introduce a new class of fast algorithms for convolutional neural networks…
☆12Mar 22, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
noveens / distill_cf
View on GitHub
[ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.
☆24Jun 8, 2023Updated 3 years ago
qslim / epcb-gnns
View on GitHub
☆11Jun 21, 2022Updated 4 years ago
luminus-framework / luminus-immutant
View on GitHub
Immutant adapter for Luminus
☆10Sep 12, 2020Updated 5 years ago
PAL-ML / PEARL_v1
View on GitHub
☆30Jan 17, 2022Updated 4 years ago
conda-forge / jaxlib-feedstock
View on GitHub
A conda-smithy repository for jaxlib.
☆17Jul 3, 2026Updated 3 weeks ago
jiangdada1221 / DrugOrchestra
View on GitHub
☆11Nov 11, 2023Updated 2 years ago
sebastianrisi / ga-world-models
View on GitHub
☆20Jul 16, 2019Updated 7 years ago
lambdaisland / kaocha-junit-xml
View on GitHub
JUnit XML output for Kaocha
☆15Oct 2, 2025Updated 9 months ago
jderiu / spot-the-bot-code
View on GitHub
☆13Mar 1, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
SeongwoongCho / adversarial-autoaugment-pytorch
View on GitHub
Unofficial Pytorch Implementation Of AdversarialAutoAugment(ICLR2020)
☆21Feb 9, 2021Updated 5 years ago
zhoudaquan / Refiner_ViT
View on GitHub
☆110Sep 15, 2021Updated 4 years ago
thu-ml / 2by4-pretrain-acc-examples
View on GitHub
Code for "Accelerating Transformer Pre-training with 2:4 Sparsity"
☆28Dec 8, 2024Updated last year
exalearn / covid-drug-design
View on GitHub
Code and analyses related to the ExaLearn drug design efforts
☆11Sep 30, 2020Updated 5 years ago
alexandonian / contrastive-feature-loss
View on GitHub
PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)
☆55Nov 19, 2021Updated 4 years ago
facebookresearch / CCQA
View on GitHub
CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training
☆33Jul 20, 2022Updated 4 years ago
yucornetto / GG-Transformer
View on GitHub
Code and models for the paper Glance-and-Gaze Vision Transformer
☆28Jun 7, 2021Updated 5 years ago