A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"
☆74Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Synthesizer
Users that are interested in Synthesizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch☆71May 28, 2020Updated 5 years ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 3 years ago
- Implementation of RealFormer using pytorch☆101Dec 27, 2020Updated 5 years ago
- Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization (TIP 2022)☆12Sep 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Visual Transformers with Primal Object Queries for Multi-Label Image Classification☆12May 17, 2022Updated 3 years ago
- M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021☆16Oct 27, 2021Updated 4 years ago
- KoBART chatbot☆45Jun 22, 2021Updated 4 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆69Sep 19, 2021Updated 4 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆152Jun 12, 2023Updated 2 years ago
- ☆17Oct 19, 2021Updated 4 years ago
- 문서 요약 논문 정리☆15Oct 27, 2021Updated 4 years ago
- ☆18Apr 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Modular Graph Transformer Networks☆23Apr 24, 2021Updated 4 years ago
- LM pretraining for generation, reading list, resources, conference mappings.☆20Feb 25, 2020Updated 6 years ago
- ☆25Jul 15, 2023Updated 2 years ago
- annotated-transformer-kr☆15May 16, 2019Updated 6 years ago
- DeLighT: Very Deep and Light-Weight Transformers☆469Oct 16, 2020Updated 5 years ago
- BERT baselines for extractive question answering on coqa (https://stanfordnlp.github.io/coqa/)☆10Jan 27, 2020Updated 6 years ago
- Poly-encoder architecture and pre-training pipeline implementation (pytorch)☆16Jun 29, 2020Updated 5 years ago
- NeurIPS 2019 Paper Implementation☆12Nov 22, 2022Updated 3 years ago
- KoGPT2 on Huggingface Transformers☆33May 4, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆20Nov 11, 2019Updated 6 years ago
- The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"☆307Oct 27, 2023Updated 2 years ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- Train your own GPT2!☆14Apr 11, 2023Updated 2 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- Question Answering In Context☆28Nov 24, 2022Updated 3 years ago
- ☆13Jul 13, 2022Updated 3 years ago
- PyTorch implementation of Boosting Multi-Label Image Classification with Complementary Parallel Self-Distillation, IJCAI 2022.☆26Aug 25, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- High performance pytorch modules☆17Jan 14, 2023Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- ☆27Nov 5, 2022Updated 3 years ago
- ☆20Sep 7, 2019Updated 6 years ago
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆20Sep 10, 2016Updated 9 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- statnlp-neural☆32Sep 26, 2019Updated 6 years ago