clovaai / length-adaptive-transformerView external linksLinks
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago
Alternatives and similar repositories for length-adaptive-transformer
Users that are interested in length-adaptive-transformer are comparing it to the libraries listed below
Sorting:
- Method to improve inference time for BERT. This is an implementation of the paper titled "PoWER-BERT: Accelerating BERT Inference via Pro…☆62Sep 17, 2025Updated 4 months ago
- Subword Language Model for Query Auto-Completion☆67Sep 5, 2019Updated 6 years ago
- SOM-DST: Efficient Dialogue State Tracking by Selectively Overwriting Memory (ACL 2020)☆153Jul 25, 2024Updated last year
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Source code for "Efficient Training of BERT by Progressively Stacking"☆113Jul 3, 2019Updated 6 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆470Jun 22, 2022Updated 3 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- ☆14May 3, 2022Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 3 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- 국내 자연어 처리 기술을 연구 및 개발하는 스타트업 목록☆165May 10, 2020Updated 5 years ago
- Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))☆56Feb 6, 2023Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering☆36Apr 20, 2021Updated 4 years ago
- Phrase-Indexed Question Answering (PIQA)☆94Apr 27, 2019Updated 6 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Sep 22, 2020Updated 5 years ago
- FaVIQ: Fact Verification from Information-seeking Questions☆43Nov 23, 2022Updated 3 years ago
- Multi-hop dense retrieval for question answering☆218Oct 12, 2021Updated 4 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- ☆25Oct 28, 2020Updated 5 years ago
- ☆62Apr 19, 2022Updated 3 years ago
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆137Oct 24, 2019Updated 6 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 3 years ago
- PyTorch code for EMNLP 2020 Paper "Vokenization: Improving Language Understanding with Visual Supervision"☆192Mar 8, 2021Updated 4 years ago
- https://ailabs.enliple.com/☆105Feb 25, 2021Updated 4 years ago
- Training Transformers of Huggingface with KoNLPy☆68Aug 28, 2020Updated 5 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 2 years ago
- 한국어 어휘 의미 분석 모델☆21Apr 4, 2022Updated 3 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Do Neural Language Representations Learn Physical Commonsense?☆22Dec 28, 2021Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- Longformer: The Long-Document Transformer☆2,186Feb 8, 2023Updated 3 years ago