gucci-j / light-transformer-emnlp2021View external linksLinks
EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Nov 21, 2021Updated 4 years ago
Alternatives and similar repositories for light-transformer-emnlp2021
Users that are interested in light-transformer-emnlp2021 are comparing it to the libraries listed below
Sorting:
- ☆18Nov 25, 2022Updated 3 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Jul 4, 2021Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Official code of our work, Syntax-augmented Multilingual BERT for Cross-lingual Transfer [ACL 2021].☆16Dec 2, 2021Updated 4 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 5 years ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆44May 10, 2023Updated 2 years ago
- code for paper Revisiting the Negative Data of Distantly Supervised Relation Extraction☆20Feb 22, 2022Updated 3 years ago
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆43Aug 16, 2021Updated 4 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- ☆10Sep 27, 2021Updated 4 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 2 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- Code for "SEE-Few: Seed, Expand and Entail for Few-shot Named Entity Recognition", accepted at COLING 2022.☆12Nov 25, 2022Updated 3 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆116Oct 27, 2022Updated 3 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- ☆14Aug 3, 2022Updated 3 years ago
- ☆17May 31, 2023Updated 2 years ago
- Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.☆30Nov 5, 2021Updated 4 years ago
- ☆31May 26, 2021Updated 4 years ago
- [EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)☆28Nov 22, 2022Updated 3 years ago
- Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021. It is based on our NERE toolkit (https://github.…☆122Apr 13, 2022Updated 3 years ago
- 📜 Codes and Data for COLING2020 paper: Towards Accurate and Consistent Evaluation: A Dataset for Distantly-Supervised Relation Extractio…☆31Feb 2, 2021Updated 5 years ago
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021☆36May 8, 2021Updated 4 years ago
- codes for ACL2021 paper "SENT: Sentence-level Distant Relation Extraction via Negative Training"☆29Dec 17, 2021Updated 4 years ago
- ☆13Apr 16, 2021Updated 4 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- State of What Art? A Call for Multi-Prompt LLM Evaluation☆15Jul 10, 2024Updated last year
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 4 years ago
- Website for HKU NLP group (under construction)☆14Dec 23, 2025Updated last month
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 2 months ago