lanwuwei / GigaBERT
Zero-shot Transfer Learning from English to Arabic
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for GigaBERT
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆23Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Updated 3 years ago
- Lexical Simplification with Pretrained Encoders☆69Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆54Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆12Updated 4 years ago
- ☆29Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆92Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆70Updated last year
- Multilingual abstractive summarization dataset extracted from WikiHow.☆81Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆39Updated 5 years ago
- Massively Multilingual Transfer for NER☆85Updated 3 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆38Updated 3 years ago
- Transformer based translation quality estimation☆107Updated last year
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆35Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆97Updated last year
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- ☆12Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆91Updated last year
- ☆91Updated 8 months ago
- Codebase for probing and visualizing multilingual models.☆45Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 2 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- XED multilingual emotion datasets☆56Updated last year