DeMoriarty / TorchPQLinks

Approximate nearest neighbor search with product quantization on GPU in pytorch and cuda

☆228

Alternatives and similar repositories for TorchPQ

Users that are interested in TorchPQ are comparing it to the libraries listed below

Sorting:

lucidrains / sinkhorn-transformer
Sinkhorn Transformer - Practical implementation of Sparse Sinkhorn Attention
☆268Updated 4 years ago
NVIDIA / transformer-ls
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Updated 3 years ago
DeMoriarty / fast_pytorch_kmeans
This is a pytorch implementation of k-means clustering algorithm
☆333Updated 8 months ago
lucidrains / linformer
Implementation of Linformer for Pytorch
☆301Updated last year
mlpen / Nystromformer
☆385Updated 2 years ago
pytorch / nestedtensor
[Prototype] Tools for the concurrent manipulation of variably sized Tensors.
☆251Updated 3 years ago
lucidrains / routing-transformer
Fully featured implementation of Routing Transformer
☆296Updated 4 years ago
google-research / diffstride
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆124Updated 3 years ago
facebookresearch / mega
Sequence modeling with Mega.
☆301Updated 2 years ago
huggingface / pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
☆547Updated 4 years ago
lucidrains / memory-efficient-attention-pytorch
Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"
☆383Updated 2 years ago
ctlllll / SGConv
☆164Updated 2 years ago
lucidrains / local-attention
An implementation of local windowed attention for language modeling
☆484Updated 4 months ago
lucidrains / flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
☆216Updated 2 years ago
facebookresearch / diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …
☆236Updated 2 years ago
LeviViana / torch_sampling
Efficient reservoir sampling implementation for PyTorch
☆107Updated 4 years ago
ppwwyyxx / RAM-multiprocess-dataloader
Demystify RAM Usage in Multi-Process Data Loaders
☆204Updated 2 years ago
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆183Updated 2 years ago
utsaslab / MONeT
MONeT framework for reducing memory consumption of DNN training
☆174Updated 4 years ago
cybertronai / pytorch-lamb
Implementation of https://arxiv.org/abs/1904.00962
☆377Updated 4 years ago
ptillet / torch-blocksparse
Block-sparse primitives for PyTorch
☆160Updated 4 years ago
lukemelas / do-you-even-need-attention
Is the attention layer even necessary? (https://arxiv.org/abs/2105.02723)
☆483Updated 4 years ago
kaiyuyue / torchshard
Slicing a PyTorch Tensor Into Parallel Shards
☆301Updated 5 months ago
kakaobrain / torchlars
A LARS implementation in PyTorch
☆352Updated 5 years ago
HomebrewML / revlib
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
☆131Updated 3 years ago
prigoyal / pytorch_memonger
Experimental ground for optimizing memory of pytorch models
☆365Updated 7 years ago
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆276Updated 3 years ago
lucidrains / linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
☆811Updated last year
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆331Updated 2 years ago
google-research / long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
☆767Updated last year