lanwuwei / GigaBERT
Zero-shot Transfer Learning from English to Arabic
☆29Updated 2 years ago
Alternatives and similar repositories for GigaBERT:
Users that are interested in GigaBERT are comparing it to the libraries listed below
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Arabic edition of ALBERT pretrained language models☆16Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 2 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- ☆12Updated 4 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆94Updated 2 years ago
- Lexical Simplification with Pretrained Encoders☆70Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆74Updated last year
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated 2 weeks ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Codebase for probing and visualizing multilingual models.☆47Updated 4 years ago
- Arabic NER system with a strong performance☆35Updated 5 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Updated 4 years ago
- Use BERT to Fill in the Blanks☆82Updated 3 years ago
- Disambiguate is a tool for training and using state of the art neural WSD models☆59Updated 2 years ago
- HateEval 2019 - Task 5☆17Updated 6 years ago
- ☆75Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆89Updated 3 weeks ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated 4 months ago
- ☆68Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago