szhangtju/The-compression-of-Transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/szhangtju/The-compression-of-Transformer)

szhangtju / The-compression-of-Transformer

☆65

Alternatives and similar repositories for The-compression-of-Transformer

Users that are interested in The-compression-of-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

khakhulin / compressed-transformer
View on GitHub
Compression of NMT transformer model with tensor methods
☆49Jun 7, 2019Updated 7 years ago
tt-embedding / tt-embeddings
View on GitHub
☆28Oct 21, 2019Updated 6 years ago
arkmagus / tensor_rnn
View on GitHub
An implementation of various tensor-based decomposition for NN & RNN parameters
☆20Jun 3, 2018Updated 8 years ago
chenjoya / dropit
View on GitHub
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)
☆32Apr 8, 2023Updated 3 years ago
lljbash / FastTT
View on GitHub
Performs a faster tensor train (TT) decomposition for large sparse data
☆14Sep 7, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tatsu-lab / mlm_inductive_bias
View on GitHub
Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"
☆16Apr 13, 2021Updated 5 years ago
Tuyki / TT_RNN
View on GitHub
☆103Mar 2, 2018Updated 8 years ago
Andrew-Tierno / QuantizedTransformer
View on GitHub
Implementation of a Quantized Transformer Model
☆20Mar 20, 2019Updated 7 years ago
whyNLP / Probabilistic-Transformer
View on GitHub
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆26Oct 22, 2023Updated 2 years ago
xwcao / TGAN
View on GitHub
Code for the paper "Tensorizing Generative Adversarial Nets"
☆15Mar 22, 2018Updated 8 years ago
colehawkins / bayesian-tensor-rank-determination
View on GitHub
☆13Dec 17, 2021Updated 4 years ago
viking-sudo-rm / industrial-stacknns
View on GitHub
Stack neural networks applied to hefty natural language tasks.
☆15Dec 26, 2019Updated 6 years ago
sanagno / adaptively_sparse_attention
View on GitHub
☆24Jul 7, 2023Updated 3 years ago
FranxYao / RDP
View on GitHub
Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization
☆13Jul 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LiUzHiAn / cv-utils
View on GitHub
Basic image/video processing utilities in Python.
☆12Apr 19, 2021Updated 5 years ago
KhrulkovV / tt-pytorch
View on GitHub
☆59Jul 6, 2020Updated 6 years ago
jemisjoky / umps_code
View on GitHub
u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…
☆19Jul 2, 2020Updated 6 years ago
onucharles / tensorized-rnn
View on GitHub
A fully tensorized recurrent neural network using tensor-train decomposition
☆26Dec 13, 2022Updated 3 years ago
bigganbing / Fairseq_MorphTE
View on GitHub
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Oct 29, 2022Updated 3 years ago
aistairc / rnng-pytorch
View on GitHub
☆23Jul 23, 2021Updated 5 years ago
AndPotap / einsum-search
View on GitHub
☆34Oct 4, 2024Updated last year
androstj / tensor_rnn
View on GitHub
An implementation of various tensor-based decomposition for NN & RNN parameters
☆18Jun 4, 2018Updated 8 years ago
philip-bl / tensorizing_neural_networks-novikov_2015
View on GitHub
MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)
☆14Oct 22, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
Guan-t7 / myTLAE
View on GitHub
Community Implementation of *Temporal Latent Auto-Encoder* as described in [Temporal Latent Auto-Encoder: A Method for Probabilistic Mult…
☆15Jun 9, 2022Updated 4 years ago
Runjing-Liu120 / RaoBlackwellizedSGD
View on GitHub
A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions
☆22May 5, 2019Updated 7 years ago
Bihaqo / TensorNet
View on GitHub
☆141Nov 24, 2017Updated 8 years ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
minhtannguyen / transformer-mgk
View on GitHub
This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"
☆28Aug 13, 2022Updated 3 years ago
nikitakit / tetra-tagging
View on GitHub
Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
☆14Jul 6, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yantijin / ScoreGradPred
View on GitHub
Code for *ScoreGrad: Multivariate Probabilistic Time Series Forecasting with Continuous Energy-based Generative Models*
☆89Feb 4, 2025Updated last year
YangletLiu / Tensor_Layer_for_Deep_Neural_Network_Compression
View on GitHub
Apply CP, Tucker, TT/TR, HT to compress neural networks. Train from scratch.
☆17Nov 26, 2020Updated 5 years ago
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
iesl / s-diora
View on GitHub
☆12Jan 29, 2021Updated 5 years ago
deep-spin / sparse-communication
View on GitHub
☆12Mar 7, 2022Updated 4 years ago
yifanycc / loretta
View on GitHub
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆39Jan 9, 2025Updated last year
VPeterV / RankSpace-Models
View on GitHub
source code for NAACL2022 main conference "Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs"
☆10Sep 26, 2022Updated 3 years ago