lessw2020 / Ranger22Links
Testing various improvements to Ranger21 for 2022
☆19Updated last year
Alternatives and similar repositories for Ranger22
Users that are interested in Ranger22 are comparing it to the libraries listed below
Sorting:
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆144Updated 9 months ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆219Updated 2 years ago
- ☆75Updated 3 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆57Updated 3 years ago
- Implementation of LogAvgExp for Pytorch☆37Updated 8 months ago
- Axial Positional Embedding for Pytorch☆84Updated 10 months ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 4 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆131Updated 3 years ago
- An open source implementation of CLIP.☆33Updated 3 years ago
- ☆41Updated 4 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆98Updated 4 years ago
- Layerwise Batch Entropy Regularization☆24Updated 3 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.☆67Updated 2 weeks ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 7 months ago
- PyTorch interface for TrueGrad Optimizers☆43Updated 2 years ago
- Framework for creating (partially) reversible neural networks with PyTorch☆155Updated 3 years ago
- Utilities for Training Very Large Models☆58Updated last year
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 5 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- MaskedTensors for PyTorch☆38Updated 3 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 3 years ago
- Simple CIFAR-10 classification with ConvMixer☆45Updated 3 years ago