HomebrewML / revlibLinks

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

☆131

Alternatives and similar repositories for revlib

Users that are interested in revlib are comparing it to the libraries listed below

Sorting:

facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆332Updated 2 years ago
lucidrains / Adan-pytorch
Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch
☆252Updated 3 years ago
lucidrains / flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆218Updated 2 years ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆184Updated 2 years ago
rwightman / efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
☆129Updated last year
DarshanDeshpande / jax-models
Unofficial JAX implementations of deep learning research papers
☆159Updated 3 years ago
jiaweizzhao / ZerO-initialization
☆75Updated 3 years ago
lucidrains / panoptic-transformer
Another attempt at a long-context / efficient transformer by me
☆38Updated 3 years ago
rom1504 / gpu-tester
gpu tester detects broken and slow gpus in a cluster
☆72Updated 2 years ago
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆188Updated last month
michaelsdr / momentumnet
Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities
☆208Updated last year
SirRob1997 / Crowded-Valley---Results
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Updated 4 years ago
lucidrains / ponder-transformer
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆81Updated 4 years ago
jxbz / fromage
🧀 Pytorch code for the Fromage optimiser.
☆129Updated last year
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
Felix-Petersen / algovision
Differentiable Algorithms and Algorithmic Supervision.
☆116Updated 2 years ago
lucidrains / Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆207Updated 2 years ago
facebookresearch / diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …
☆237Updated 2 years ago
lucidrains / nystrom-attention
Implementation of Nyström Self-attention, from the paper Nyströmformer
☆141Updated 8 months ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆108Updated 4 years ago
n2cholas / jax-resnet
Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).
☆117Updated 3 years ago
lucidrains / glom-pytorch
An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…
☆194Updated 4 years ago
OATML / RHO-Loss
☆209Updated 3 years ago
tmbdev-archive / webdataset-lightning
A small demonstration of using WebDataset with ImageNet and PyTorch Lightning
☆75Updated last year
brohrer / sharpened-cosine-similarity
An alternative to convolution in neural networks
☆258Updated last year
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆189Updated 3 years ago
lucidrains / g-mlp-gpt
GPT, but made only out of MLPs
☆89Updated 4 years ago
kingoflolz / CLIP_JAX
Contrastive Language-Image Pretraining
☆144Updated 3 years ago
lucidrains / hourglass-transformer-pytorch
Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI
☆97Updated 3 years ago