tt-embedding / tt-embeddingsLinks

☆27

Alternatives and similar repositories for tt-embeddings

Users that are interested in tt-embeddings are comparing it to the libraries listed below

Sorting:

asappresearch / flop
Pytorch library for factorized L0-based pruning.
☆45Updated last year
KhrulkovV / tt-pytorch
☆60Updated 5 years ago
szhangtju / The-compression-of-Transformer
☆64Updated 4 years ago
chentingpc / dpq_embedding_compression
Differentiable Product Quantization for End-to-End Embedding Compression.
☆62Updated 2 years ago
Noahs-ARK / RFA
☆33Updated 4 years ago
JetRunner / PABEE
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆65Updated 4 years ago
microsoft / Stochastic-Mixture-of-Experts
This package implements THOR: Transformer with Stochastic Experts.
☆65Updated 3 years ago
androstj / tensor_rnn
An implementation of various tensor-based decomposition for NN & RNN parameters
☆18Updated 7 years ago
Noahs-ARK / PaLM
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Updated 5 years ago
libeineu / ODE-Transformer
This is a code repository for the ACL 2022 paper "ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generati…
☆34Updated 2 years ago
jemisjoky / umps_code
u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…
☆19Updated 5 years ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated last year
twinkle0331 / Xcompression
[ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)
☆22Updated 2 years ago
robert-lieck / RBN
Recursive Bayesian Networks
☆11Updated 2 months ago
srush / mamba-scans
Blog post
☆17Updated last year
VITA-Group / BERT-Tickets
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…
☆140Updated 3 years ago
khakhulin / compressed-transformer
Compression of NMT transformer model with tensor methods
☆48Updated 6 years ago
huggingface / block_movement_pruning
Block Sparse movement pruning
☆81Updated 4 years ago
zhuohan123 / macaron-net
Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"
☆148Updated 6 years ago
zhangjiong724 / spectral-RNN
STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION
☆16Updated 7 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆54Updated 2 years ago
dodgejesse / show_your_work
☆11Updated 5 years ago
eamartin / parallelizing_linear_rnns
☆44Updated 7 years ago
KurochkinAlexey / AntisymmetricRNN
Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"
☆15Updated 6 years ago
belindal / TaskBench500
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Updated 3 years ago
lancopku / Prime
A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
☆85Updated 2 years ago
FranxYao / RDP
Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization
☆14Updated 3 years ago
yaohungt / TransformerDissection
[EMNLP'19] Summary for Transformer Understanding
☆53Updated 5 years ago
Shark-NLP / CAB
☆31Updated 2 years ago
HazyResearch / structured-nets
Structured matrices for compressing neural networks
☆67Updated last year