The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo
☆16Sep 1, 2021Updated 4 years ago
Alternatives and similar repositories for subformer
Users that are interested in subformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 22, 2021Updated 5 years ago
- ☆17Mar 17, 2020Updated 6 years ago
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆12Dec 8, 2022Updated 3 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 5 years ago
- ☆12Feb 18, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆70May 29, 2019Updated 7 years ago
- [ICLR 2022] "Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations?" by Yonggan Fu, Shunyao Zhang, Shang Wu…☆36Mar 16, 2022Updated 4 years ago
- reviese pyrouge files for supporting winxp win 8.1 win10☆12Nov 21, 2017Updated 8 years ago
- Mask Attention Networks: Rethinking and Strengthen Transformer in NAACL2021☆14Jun 3, 2021Updated 4 years ago
- Main repo to keep scripts, dockerfiles, wiki, etc☆15Mar 14, 2023Updated 3 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Oct 18, 2019Updated 6 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Vim theme inspired in andromeda.☆16May 25, 2020Updated 6 years ago
- Economics of Ransomware | Dataset☆15May 2, 2018Updated 8 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆23Apr 1, 2022Updated 4 years ago
- ☆12Aug 4, 2018Updated 7 years ago
- ☆14Nov 23, 2020Updated 5 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…☆11Jul 19, 2020Updated 5 years ago
- ☆13Aug 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TensorFlow 2.0 implementations of various autoencoders.☆16Apr 22, 2019Updated 7 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- A python powered normalized compression distance (NCD) calculator.☆14Jan 26, 2016Updated 10 years ago
- A Python package for analysis of geometric morphometric data.☆10Jun 7, 2018Updated 7 years ago
- Enhancing AMR-to-Text Generation with Dual Graph Representations (implementation for the EMNLP-IJCNLP-2019 paper)☆22May 20, 2020Updated 6 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆23Apr 23, 2021Updated 5 years ago
- Implementation for Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews.☆25Sep 9, 2020Updated 5 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆12Dec 30, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection of Notebooks for Natural Language Processing with PyTorch☆31Jan 31, 2019Updated 7 years ago
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆13Dec 19, 2019Updated 6 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- 使用卷积神经网络识别恶意软件,其特点是把文件的每个字节都当做输入☆16Oct 14, 2024Updated last year
- A text document will be provided and it'll produce it's summary☆28May 25, 2020Updated 6 years ago
- A Correlated Topic Model implementation in Python.☆33Apr 24, 2020Updated 6 years ago
- ☆24Feb 15, 2020Updated 6 years ago