tm4roon / data-augmentation-for-nlp
An implementation of data augmentation methods for natural language processing tasks.
☆13Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for data-augmentation-for-nlp
- Codebase for probing and visualizing multilingual models.☆45Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Updated 4 years ago
- ☆43Updated 5 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆52Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆30Updated 4 years ago
- ☆15Updated 2 years ago
- This is the repository for the Interspeech 2018 paper "Coherence models for dialogue".☆19Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆72Updated 3 months ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- Repository for KPTimes corpus☆34Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- SUM-QE, a BERT-based Summary Quality Estimation Model☆21Updated last year
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆92Updated 2 years ago
- ☆33Updated 6 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- ☆46Updated 4 years ago
- Pytorch Seq2Seq framework☆26Updated 3 weeks ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆71Updated last year
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated 4 months ago
- Source code of the paper "Do Syntax Trees Help Pre-trained Transformers Extract Information?" (EACL 2021)☆72Updated 2 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- ☆32Updated 3 years ago
- Code, data, and additional analysis for the paper Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evalua…☆15Updated 4 years ago
- Lexically Constrained Neural Machine Translation with Levenshtein Transformer☆39Updated 4 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 4 years ago
- A Cross-Domain Transferable Neural Coherence Model https://arxiv.org/abs/1905.11912☆24Updated 4 years ago
- Accompanying repository of our AAAI-20 paper "Fine-Grained Argument Unit Recognition and Classification."☆20Updated 4 years ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 4 years ago