The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo
☆16Sep 1, 2021Updated 4 years ago
Alternatives and similar repositories for subformer
Users that are interested in subformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11May 22, 2021Updated 5 years ago
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆12Dec 8, 2022Updated 3 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 5 years ago
- ☆12Feb 18, 2020Updated 6 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EDITOR: an Edit-Based Transformer with Repositioning for Neural Machine Translation with Soft Lexical Constraints☆29Dec 21, 2021Updated 4 years ago
- Code for the paper "Efficient Adaption of Pretrained Transformers for Abstractive Summarization"☆70May 29, 2019Updated 7 years ago
- ☆16Oct 5, 2022Updated 3 years ago
- reviese pyrouge files for supporting winxp win 8.1 win10☆12Nov 21, 2017Updated 8 years ago
- [NeurIPS 2024] BLAST: Block Level Adaptive Structured Matrix for Efficient Deep Neural Network Inference☆18Nov 6, 2024Updated last year
- Vim theme inspired in andromeda.☆16May 25, 2020Updated 6 years ago
- ☆19Aug 3, 2025Updated 10 months ago
- Economics of Ransomware | Dataset☆15May 2, 2018Updated 8 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆23Apr 1, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Aug 4, 2018Updated 7 years ago
- Implementation for paper "Deep Recurrent Generative Decoder for Abstractive Text Summarization" https://arxiv.org/pdf/1708.00625.pdf☆20Mar 25, 2019Updated 7 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Nov 25, 2021Updated 4 years ago
- Implementation of A New Burrows Wheeler Transform Markov Distance☆12Apr 19, 2020Updated 6 years ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- Learn symforce together :)☆32Aug 23, 2022Updated 3 years ago
- Learning from Graphs: From Mathematical Principles to Practical Tools☆11Apr 16, 2021Updated 5 years ago
- ☆13Aug 31, 2024Updated last year
- TensorFlow 2.0 implementations of various autoencoders.☆16Apr 22, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation (CVPR 2024)☆22Oct 8, 2025Updated 8 months ago
- PyTorch Implementation of "Learning Natural Language Inference with LSTM", 2016, S. Wang et al. (https://arxiv.org/pdf/1512.08849.pdf)☆19Dec 23, 2022Updated 3 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆23Apr 23, 2021Updated 5 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated 2 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- double array trie algorithm golang☆18Oct 28, 2014Updated 11 years ago
- ☆16Dec 21, 2023Updated 2 years ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆95Jan 24, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A TensorFlow [2.0] implementation of ProSeNet: "Interpretable and Steerable Sequence Learning via Prototypes" (Ming et al., 2019)☆13Dec 19, 2019Updated 6 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- Hands-On Data Science with R, published by Packt☆23Jan 30, 2023Updated 3 years ago
- A neural model for knowledge graph construction from text.☆21Dec 8, 2022Updated 3 years ago
- ☆24Feb 15, 2020Updated 6 years ago
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- ☆28Oct 6, 2020Updated 5 years ago