Compression of NMT transformer model with tensor methods
☆49Jun 7, 2019Updated 6 years ago
Alternatives and similar repositories for compressed-transformer
Users that are interested in compressed-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆59Jul 6, 2020Updated 5 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- Implementation of a Quantized Transformer Model☆20Mar 20, 2019Updated 7 years ago
- An implementation of various tensor-based decomposition for NN & RNN parameters☆20Jun 3, 2018Updated 7 years ago
- ☆28Oct 21, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Tensor Train decomposition on TensorFlow☆227Apr 6, 2021Updated 5 years ago
- ☆13Dec 17, 2021Updated 4 years ago
- Tools for using AD to optimize tensor networks, built on top of ITensor and AutoHOOT.☆14Sep 23, 2022Updated 3 years ago
- ☆24Oct 21, 2024Updated last year
- Anonymized supplementary code for NeurIPS submission☆28May 22, 2019Updated 6 years ago
- ☆103Mar 2, 2018Updated 8 years ago
- Deep convolutional tensor network☆11Sep 29, 2020Updated 5 years ago
- ☆14Nov 16, 2022Updated 3 years ago
- Replace FC2, LeNet-5, VGG, Resnet, Densenet's full-connected layers with MPO☆33Sep 17, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Oct 7, 2020Updated 5 years ago
- code of low-rank tensor train for tensor robust principal component analysis☆15Feb 18, 2020Updated 6 years ago
- Notebook for illustrating the use of CMPSKit.jl and to reproduce the results from arXiv:2006.01801.☆13Nov 9, 2021Updated 4 years ago
- QCHS2022 Summer School Tutorial for Yao and Bloqade☆13Jan 13, 2023Updated 3 years ago
- lecture materials of the ML for Physics course 2021 in Perimeter Institute☆21Mar 31, 2021Updated 5 years ago
- End-to-end Speech Translation with Stacked Acoustic-and-Textual Encoding☆26Aug 12, 2021Updated 4 years ago
- ☆14Dec 12, 2018Updated 7 years ago
- The Anole Programming Language☆17Nov 13, 2022Updated 3 years ago
- Julia library for quantum circuit simulation using tensor networks☆16Jan 7, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Translate - a PyTorch Language Library☆10Mar 14, 2019Updated 7 years ago
- MUSCO: MUlti-Stage COmpression of neural networks☆72Feb 16, 2021Updated 5 years ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆16Jul 17, 2024Updated last year
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- ☆19Sep 15, 2021Updated 4 years ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- Self-explaining Matrix Product States library in Python☆22Sep 2, 2023Updated 2 years ago
- Learning to Hash for Maximum Inner Product Search☆12Jan 21, 2022Updated 4 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"☆12Jun 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The project page of paper: Aha! Adaptive History-driven Attack for Decision-based Black-box Models [ICCV 2021]☆10Feb 23, 2022Updated 4 years ago
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆20Mar 18, 2019Updated 7 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Jan 5, 2023Updated 3 years ago
- Modern Gaussian Processes: Scalable Inference and Novel Applications☆20Jul 13, 2019Updated 6 years ago
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- this repository is created to accumulate all LaTeX templates needed at Skoltech☆20Nov 27, 2018Updated 7 years ago
- Adversarial Training for Natural Language Understanding☆252Sep 6, 2023Updated 2 years ago