iabufarha / ArSarcasm-v2View external linksLinks
ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analysis, which is a part of WANLP 2021.
☆12Jan 26, 2022Updated 4 years ago
Alternatives and similar repositories for ArSarcasm-v2
Users that are interested in ArSarcasm-v2 are comparing it to the libraries listed below
Sorting:
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆26Feb 18, 2021Updated 4 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆55Jun 21, 2024Updated last year
- ☆12Jun 6, 2020Updated 5 years ago
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 4 years ago
- Arabic Art using GANs☆17Aug 3, 2022Updated 3 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Aug 4, 2024Updated last year
- Arabic Language Model based on Bert☆19Mar 22, 2020Updated 5 years ago
- Religious Hate Speech Detection for Arabic Tweets☆26Feb 1, 2019Updated 7 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Jun 22, 2022Updated 3 years ago
- Creating knowledge graphs by scraping wiki pages and storing data in the Neo4j Graph DB.☆29May 17, 2021Updated 4 years ago
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Dec 8, 2022Updated 3 years ago
- A Python implementation of Farasa toolkit☆139Sep 11, 2025Updated 5 months ago
- ☆10Feb 2, 2026Updated 2 weeks ago
- ☆10Jan 8, 2022Updated 4 years ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- ☆34Dec 14, 2023Updated 2 years ago
- ☆36Jul 16, 2021Updated 4 years ago
- source files for GloBI website☆10Feb 8, 2026Updated last week
- open source knowledge for Syllabics font design and development☆10Nov 13, 2024Updated last year
- A little bit of help.☆13Jul 25, 2024Updated last year
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Language Model Fine-tuning for Moby Dick☆42Mar 3, 2019Updated 6 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Apr 3, 2025Updated 10 months ago
- News clustering algorithm. Implementation of the "Multilingual Clustering of Streaming News" paper submitted to EMNLP 2018☆38May 2, 2022Updated 3 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- ☆12Oct 1, 2025Updated 4 months ago
- Spell check for Arabic text using python☆14Mar 22, 2019Updated 6 years ago
- Applied Data Science training course (for updates and resources, read the ReadMe file below)☆15Sep 9, 2023Updated 2 years ago
- Repository for our paper "AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts"☆11Jul 18, 2021Updated 4 years ago
- Reading comprehension on the Holy Qur'an☆10Oct 15, 2025Updated 4 months ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- Kaggle | 22nd place solution for RANZCR CLiP - Catheter and Line Position Challenge.☆11Apr 19, 2021Updated 4 years ago
- Author Profiling for Abuse Detection (COLING 2018)☆10Dec 8, 2022Updated 3 years ago
- Latent Dirichlet Allocation on tweets☆15May 17, 2015Updated 10 years ago
- Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification☆10May 31, 2022Updated 3 years ago
- Using Pytorch to solve the famous titanic dataset problem, or in other words, killing a fly with a tank.☆11Apr 8, 2019Updated 6 years ago
- ☆11Jul 12, 2021Updated 4 years ago