toshas / torch-discounted-cumsumLinks

Fast Discounted Cumulative Sums in PyTorch

☆96

Alternatives and similar repositories for torch-discounted-cumsum

Users that are interested in torch-discounted-cumsum are comparing it to the libraries listed below

Sorting:

lucidrains / ponder-transformer
Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper
☆81Updated 3 years ago
lucidrains / compressive-transformer-pytorch
Pytorch implementation of Compressive Transformers, from Deepmind
☆162Updated 3 years ago
ischlag / fast-weight-transformers
Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.
☆105Updated 4 years ago
lucidrains / HTM-pytorch
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
☆75Updated 3 years ago
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Updated 2 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
choidami / sst
☆50Updated 4 years ago
lucidrains / mlp-gpt-jax
A GPT, made only of MLPs, in Jax
☆58Updated 4 years ago
lucidrains / feedback-transformer-pytorch
Implementation of Feedback Transformer in Pytorch
☆107Updated 4 years ago
clemkoa / ntm
Neural Turing Machines in pytorch
☆48Updated 3 years ago
mgrankin / minGPT
minGPT in JAX
☆48Updated 3 years ago
srush / torch-queue
☆68Updated last year
harvardnlp / genbmm
CUDA kernels for generalized matrix-multiplication in PyTorch
☆85Updated 3 years ago
RobertCsordas / transformer_generalization
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆67Updated 2 years ago
davda54 / ada-hessian
Easy-to-use AdaHessian optimizer (PyTorch)
☆79Updated 4 years ago
n2cholas / jax-resnet
Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).
☆112Updated 3 years ago
ssnl / PyTorch-Reparam-Module
Reparameterize your PyTorch modules
☆71Updated 4 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 3 years ago
lucidrains / product-key-memory
Standalone Product Key Memory module in Pytorch - for augmenting Transformer models
☆82Updated last year
lucidrains / Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆204Updated last year
cpcp1998 / PermuteFormer
Code for the paper PermuteFormer
☆42Updated 3 years ago
microsoft / ReinMax
Beyond Straight-Through
☆100Updated 2 years ago
HazyResearch / structured-nets
Structured matrices for compressing neural networks
☆67Updated last year
jxbz / fromage
🧀 Pytorch code for the Fromage optimiser.
☆125Updated last year
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
lucidrains / g-mlp-gpt
GPT, but made only out of MLPs
☆89Updated 4 years ago
lucidrains / einops-exts
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆55Updated 2 years ago
Felix-Petersen / algovision
Differentiable Algorithms and Algorithmic Supervision.
☆115Updated 2 years ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
shyamsn97 / hyper-nn
Easy Hypernetworks in Pytorch and Jax
☆103Updated 2 years ago