UBC-NLP / aoc_idLinks
Arabic Dialect Identification on AOC data.
☆24Updated 6 years ago
Alternatives and similar repositories for aoc_id
Users that are interested in aoc_id are comparing it to the libraries listed below
Sorting:
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆10Updated 2 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆48Updated last year
- ☆14Updated 4 years ago
- Arabic edition of BERT pretrained language models☆130Updated 4 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Updated 5 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆109Updated 3 years ago
- ☆54Updated 3 years ago
- ☆43Updated 9 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 4 years ago
- ☆109Updated last year
- A Python implementation of Farasa toolkit☆132Updated last month
- This repository provides our datasets for Arabic emotion detection in Twitter☆9Updated 7 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
- HateEval 2019 - Task 5☆16Updated 6 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Arabic edition of ALBERT pretrained language models☆16Updated 4 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆92Updated 4 months ago
- Curated list of publicly available parallel corpus for Indian Languages☆33Updated 4 years ago
- Transformer based translation quality estimation☆112Updated last year
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆107Updated 8 years ago
- ☆35Updated 3 years ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- ☆17Updated 5 years ago
- Arabic Language Model based on Bert☆19Updated 5 years ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 5 months ago