helmy-elrais / RoBERT_Recurrence_over_BERT
pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch implementation of other Hierarchical Methods (Mean Pooling and Max Pooling) and Truncation Methods (Head Only and Tail Only) presented in this paper https://arxiv.org/abs/1905.05583
☆80Updated 2 years ago
Alternatives and similar repositories for RoBERT_Recurrence_over_BERT:
Users that are interested in RoBERT_Recurrence_over_BERT are comparing it to the libraries listed below
- ☆85Updated 3 years ago
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- ☆40Updated 3 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆80Updated 2 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆133Updated 11 months ago
- ☆57Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 3 years ago
- A repo to explore different NLP tasks which can be solved using T5☆170Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Benchmarking various Deep Learning models such as BERT, ALBERT, BiLSTMs on the task of sentence entailment using two datasets - MultiNLI …☆28Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆132Updated last year
- Master thesis with code investigating methods for incorporating long-context reasoning in low-resource languages, without the need to pre…☆32Updated 3 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Codebase for the Medium Article on Fine-tuning GPT2 for Text Generation☆69Updated 4 years ago
- This shows how to fine-tune Bert language model and use PyTorch-transformers for text classififcation☆70Updated 4 years ago
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approa…☆91Updated 2 years ago
- This is the official code for Extractive Summarization of Long Documents by Combining Global and Local Context☆68Updated 4 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- ☆77Updated 8 months ago
- ☆42Updated 4 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆87Updated 3 years ago
- ☆120Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆107Updated 4 years ago
- Using BERT for long sentence classification (more than 512 word pieces).☆17Updated 3 years ago
- ☆59Updated 3 years ago
- ☆43Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆92Updated 2 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆93Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆85Updated 3 years ago