The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo
☆16Sep 1, 2021Updated 4 years ago
Alternatives and similar repositories for subformer
Users that are interested in subformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 11, 2021Updated 4 years ago
- ☆11May 22, 2021Updated 4 years ago
- an implementation of CryptoNets☆11Aug 22, 2018Updated 7 years ago
- ☆17Mar 17, 2020Updated 6 years ago
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆13Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆70May 29, 2019Updated 6 years ago
- reviese pyrouge files for supporting winxp win 8.1 win10☆12Nov 21, 2017Updated 8 years ago
- Automatically punctuate lecture transcripts obtained from YouTube.☆17Jun 12, 2020Updated 5 years ago
- Design Statistical Models on OpenClassrooms☆13Jul 18, 2019Updated 6 years ago
- Homomorphic comparison in leveled homomorphic encryption and its applications☆39Jun 25, 2021Updated 4 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Predict edit intentions on Wikipedia☆19Jan 24, 2019Updated 7 years ago
- ☆19Aug 3, 2025Updated 7 months ago
- ☆19Aug 1, 2020Updated 5 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Implementation of A New Burrows Wheeler Transform Markov Distance☆12Apr 19, 2020Updated 5 years ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- ☆47Aug 21, 2024Updated last year
- Learning from Graphs: From Mathematical Principles to Practical Tools☆11Apr 16, 2021Updated 4 years ago
- A python powered normalized compression distance (NCD) calculator.☆14Jan 26, 2016Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PyTorch Implementation of "Learning Natural Language Inference with LSTM", 2016, S. Wang et al. (https://arxiv.org/pdf/1512.08849.pdf)☆19Dec 23, 2022Updated 3 years ago
- Enhancing AMR-to-Text Generation with Dual Graph Representations (implementation for the EMNLP-IJCNLP-2019 paper)☆22May 20, 2020Updated 5 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆23Apr 23, 2021Updated 4 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆11Feb 8, 2026Updated last month
- [ICLR 2025 SCI-FM Workshop] Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging☆14Mar 27, 2025Updated last year
- double array trie algorithm golang☆18Oct 28, 2014Updated 11 years ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆94Jan 24, 2024Updated 2 years ago
- A collection of papers related to speech model compression☆26Jul 31, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- AES文件加密解密☆10Apr 26, 2023Updated 2 years ago
- 使用卷积神经网络识别恶意软件,其特点是把文件的每个字节都当做输入☆16Oct 14, 2024Updated last year
- FEM micromagnetic simulator☆11Mar 19, 2026Updated last week
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- A Correlated Topic Model implementation in Python.☆33Apr 24, 2020Updated 5 years ago
- Hands-On Data Science with R, published by Packt☆23Jan 30, 2023Updated 3 years ago
- ☆28Oct 6, 2020Updated 5 years ago