EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Nov 21, 2021Updated 4 years ago
Alternatives and similar repositories for light-transformer-emnlp2021
Users that are interested in light-transformer-emnlp2021 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 25, 2022Updated 3 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆44Aug 16, 2021Updated 4 years ago
- Codebase, data and models for hallucination of pruned models☆16Jan 11, 2025Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Source code for the ACL 2019 paper "Attention-based Conditioning Methods for External Knowledge Integration"☆59Jun 21, 2022Updated 3 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- ☆17May 25, 2020Updated 5 years ago
- ☆14Aug 3, 2022Updated 3 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 3 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 4 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- Official code of our work, Syntax-augmented Multilingual BERT for Cross-lingual Transfer [ACL 2021].☆16Dec 2, 2021Updated 4 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- Code for the EMNLP 2021 Paper "Active Learning by Acquiring Contrastive Examples" & the ACL 2022 Paper "On the Importance of Effectively …☆128May 24, 2022Updated 3 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Apr 23, 2022Updated 3 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆117Oct 27, 2022Updated 3 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)☆143Oct 25, 2021Updated 4 years ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆44May 10, 2023Updated 2 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆14Feb 3, 2021Updated 5 years ago
- X2Static embeddings☆15Jul 15, 2021Updated 4 years ago
- ☆13Apr 16, 2021Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago