The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo
☆16Sep 1, 2021Updated 4 years ago
Alternatives and similar repositories for subformer
Users that are interested in subformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sentiment Analysis on Amazon Fine Food Reviews Data in Python☆12Jun 3, 2018Updated 7 years ago
- ☆12Feb 18, 2020Updated 6 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆70May 29, 2019Updated 6 years ago
- Automatically punctuate lecture transcripts obtained from YouTube.☆17Jun 12, 2020Updated 5 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Oct 18, 2019Updated 6 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- Predict edit intentions on Wikipedia☆19Jan 24, 2019Updated 7 years ago
- Vim theme inspired in andromeda.☆16May 25, 2020Updated 5 years ago
- Economics of Ransomware | Dataset☆15May 2, 2018Updated 7 years ago
- ☆14Nov 23, 2020Updated 5 years ago
- ☆12Aug 4, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Aug 1, 2020Updated 5 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Implementation of A New Burrows Wheeler Transform Markov Distance☆12Apr 19, 2020Updated 5 years ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- ☆10Dec 30, 2020Updated 5 years ago
- Learning from Graphs: From Mathematical Principles to Practical Tools☆11Apr 16, 2021Updated 4 years ago
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- ULMFiT + Siamese Network for Sentence Vectors☆33Oct 18, 2018Updated 7 years ago
- A python powered normalized compression distance (NCD) calculator.☆14Jan 26, 2016Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch Implementation of "Learning Natural Language Inference with LSTM", 2016, S. Wang et al. (https://arxiv.org/pdf/1512.08849.pdf)☆19Dec 23, 2022Updated 3 years ago
- Enhancing AMR-to-Text Generation with Dual Graph Representations (implementation for the EMNLP-IJCNLP-2019 paper)☆22May 20, 2020Updated 5 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆23Apr 23, 2021Updated 4 years ago
- ☆10Jun 16, 2022Updated 3 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- ☆11Feb 8, 2026Updated last month
- [ICLR 2025 SCI-FM Workshop] Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging☆14Mar 27, 2025Updated last year
- malicious bash scripts☆10Apr 3, 2022Updated 3 years ago
- Collection of Notebooks for Natural Language Processing with PyTorch☆31Jan 31, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- double array trie algorithm golang☆18Oct 28, 2014Updated 11 years ago
- "BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks"☆13May 10, 2024Updated last year
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆12Dec 19, 2019Updated 6 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- AES文件加密解密☆10Apr 26, 2023Updated 2 years ago
- A text document will be provided and it'll produce it's summary☆28May 25, 2020Updated 5 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year