apd10 / universal_memory_allocationLinks

☆15

Alternatives and similar repositories for universal_memory_allocation

Users that are interested in universal_memory_allocation are comparing it to the libraries listed below

Sorting:

HazyResearch / mongoose
A Learnable LSH Framework for Efficient NN Training
☆33Updated 4 years ago
chentingpc / dpq_embedding_compression
Differentiable Product Quantization for End-to-End Embedding Compression.
☆63Updated 2 years ago
hp027 / AliGraph
Large Scale Graphical Model
☆24Updated 6 years ago
ClashLuke / PerfTorch
High performance pytorch modules
☆18Updated 2 years ago
xinyandai / product-quantization
Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner P…
☆59Updated 4 years ago
facebookresearch / tbsm
Time-based Sequence Model for Personalization and Recommendation Systems
☆49Updated 4 years ago
usyd-fsalab / NeuralNetworkRandomness
☆14Updated 3 years ago
cuMF / culda_cgs
Efficient LDA solution on GPUs.
☆24Updated 7 years ago
facebookresearch / FBTT-Embedding
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …
☆194Updated 3 years ago
tt-embedding / tt-embeddings
☆27Updated 6 years ago
HazyResearch / anchor-stability
A study of the downstream instability of word embeddings
☆12Updated 3 years ago
mwydmuch / napkinXC
Extremely simple and fast extreme multi-class and multi-label classifiers.
☆70Updated 7 months ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 3 years ago
wouterkool / ancestral-gumbel-top-k-sampling
Ancestral Gumbel-Top-k Sampling
☆25Updated 5 years ago
TalSchuster / CATs
Confident Adaptive Transformers
☆13Updated 4 years ago
PanZaifeng / G-SLIDE
☆15Updated 3 years ago
RobertCsordas / moe_layer
sigma-MoE layer
☆20Updated last year
prajjwal1 / fluence
A deep learning library based on Pytorch focussed on low resource language research and robustness
☆70Updated 3 years ago
rlin27 / DeBut
Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.
☆13Updated 4 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆50Updated 3 years ago
noveens / distill_cf
[ NeurIPS '22 ] Data distillation for recommender systems. Shows equivalent performance with 2-3 orders less data.
☆23Updated 2 years ago
sustcsonglin / gated_linear_attention_layer
☆31Updated last year
ahennequ / pytorch-custom-mma
☆29Updated 3 years ago
antofuller / configaformers
A python library for highly configurable transformers - easing model architecture search and experimentation.
☆49Updated 3 years ago
petuum / tuun
Hyperparameter tuning via uncertainty modeling
☆48Updated last year
arogozhnikov / adamw_bfloat16
AdamW optimizer for bfloat16 models in pytorch 🔥.
☆37Updated last year
hpcaitech / CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
☆106Updated 2 years ago
lucidrains / esbn-transformer
An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols
☆16Updated 4 years ago
juntang-zhuang / ACProp-Optimizer
Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)
☆16Updated 4 years ago
biswajitsc / sparse-embed
Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020
☆19Updated 5 years ago