lanwuwei / GigaBERTLinks
Zero-shot Transfer Learning from English to Arabic
☆30Updated 3 years ago
Alternatives and similar repositories for GigaBERT
Users that are interested in GigaBERT are comparing it to the libraries listed below
Sorting:
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆95Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 4 months ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆117Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 4 years ago
- Transformer based translation quality estimation☆112Updated last year
- OpusFilter - Parallel corpus processing toolkit☆105Updated last week
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 2 months ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 11 months ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Source code accompanying the KONVENS 2019 paper "Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Em…☆65Updated 5 years ago
- ☆66Updated 5 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- A collection of task-specific NLU datasets☆149Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆56Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆103Updated 3 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 3 years ago
- Build a dialog dataset from online books in many languages☆75Updated 2 years ago