lucidrains / Adan-pytorchLinks

Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch

☆252

Alternatives and similar repositories for Adan-pytorch

Users that are interested in Adan-pytorch are comparing it to the libraries listed below

Sorting:

archinetai / surgeon-pytorch
A library to inspect and extract intermediate layers of PyTorch models.
☆475Updated 3 years ago
lucidrains / x-unet
Implementation of a U-net complete with efficient attention as well as the latest research findings
☆287Updated last year
lucidrains / Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆206Updated 2 years ago
lucidrains / flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆217Updated 2 years ago
brohrer / sharpened-cosine-similarity
An alternative to convolution in neural networks
☆257Updated last year
HomebrewML / revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
☆131Updated 3 years ago
lucidrains / nystrom-attention
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆141Updated 7 months ago
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆182Updated 2 years ago
infocusp / diffusion_models
Minimal standalone example of diffusion model
☆160Updated 3 years ago
lessw2020 / Ranger21
Ranger deep learning optimizer rewrite to use newest components
☆338Updated last year
lucidrains / recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…
☆204Updated last year
hristo-vrigazov / mmap.ninja
Memory mapped numpy arrays of varying shapes
☆303Updated last year
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆383Updated 2 years ago
yiyixuxu / denoising-diffusion-flax
Implementing the Denoising Diffusion Probabilistic Model in Flax
☆151Updated 2 years ago
iejMac / video2numpy
Optimized library for large-scale extraction of frames and audio from video.
☆204Updated 2 years ago
lucidrains / pytorch-custom-utils
Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…
☆123Updated last year
ctlllll / SGConv
☆164Updated 2 years ago
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆331Updated 2 years ago
jiaweizzhao / ZerO-initialization
☆75Updated 2 years ago
DarshanDeshpande / jax-models
Unofficial JAX implementations of deep learning research papers
☆158Updated 3 years ago
nocotan / pytorch-lightning-gans
Collection of PyTorch Lightning implementations of Generative Adversarial Network varieties presented in research papers.
☆169Updated 7 months ago
lucidrains / fast-transformer-pytorch
Implementation of Fast Transformer in Pytorch
☆177Updated 4 years ago
google-research / diffstride
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆124Updated 3 years ago
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆221Updated last year
facebookresearch / FFCV-SSL
FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.
☆209Updated 2 years ago
rasbt / cyclemoid-pytorch
Cyclemoid implementation for PyTorch
☆90Updated 3 years ago
unixpickle / sk2torch
Convert scikit-learn models to PyTorch modules
☆166Updated last year
meta-pytorch / torcheval
A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…
☆241Updated last month
Rayhane-mamah / Efficient-VDVAE
Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"
☆196Updated 3 years ago
lucidrains / ema-pytorch
A simple way to keep track of an Exponential Moving Average (EMA) version of your Pytorch model
☆616Updated 10 months ago