Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.
☆56Jun 21, 2024Updated last year
Alternatives and similar repositories for CAMeLBERT
Users that are interested in CAMeLBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆542Mar 5, 2026Updated last month
- BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique☆28Apr 23, 2021Updated 4 years ago
- A Python implementation of Farasa toolkit☆140Sep 11, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reading comprehension on the Holy Qur'an☆10Oct 15, 2025Updated 5 months ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆28Feb 18, 2021Updated 5 years ago
- NLP Webinars Created for Udacity's Mentorship Program (2019).☆11Nov 11, 2022Updated 3 years ago
- Arabic edition of BERT pretrained language models☆134Dec 5, 2020Updated 5 years ago
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆717Oct 17, 2022Updated 3 years ago
- Arabic To English translation using transformer neural nets.☆15Mar 15, 2019Updated 7 years ago
- Finetuning of Arabert, Dziribert and Bert arabic for dialect detection.☆16Oct 23, 2021Updated 4 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆111Jan 4, 2024Updated 2 years ago
- Python package for Arabic natural language processing☆28Jun 12, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Arabic NLP tools List inventory☆91Dec 17, 2022Updated 3 years ago
- Automatic Dialect Detection Repository☆39Nov 13, 2022Updated 3 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆116Sep 2, 2021Updated 4 years ago
- Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebo…☆423Mar 1, 2024Updated 2 years ago
- ArWordVec is a collection of pre-trained word embedding model built from huge repository of Arabic tweets in different topics. The aim of…☆19Jul 9, 2020Updated 5 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- End-to-End Arabic ASR using DeepSpeech engine☆14Nov 2, 2021Updated 4 years ago
- A Julia package for working with the Quranic Arabic Corpus.☆17Nov 6, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- Neural Arabic text diacritization☆95Mar 24, 2023Updated 3 years ago
- ☆14Mar 7, 2019Updated 7 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆108Apr 8, 2017Updated 9 years ago
- The first AI-based Arabic songwriter.☆34Apr 7, 2017Updated 9 years ago
- ☆16Jan 13, 2021Updated 5 years ago
- Large Arabic Resources For Sentiment Analysis☆121Apr 16, 2018Updated 7 years ago
- Multi-turn open-domain Arabic chatbot with a wide set of features.☆39Aug 29, 2023Updated 2 years ago
- Arabic Parser Using Stanford API☆12Nov 11, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP researc…☆419Apr 4, 2021Updated 5 years ago
- Arabic Words☆12Nov 18, 2019Updated 6 years ago
- repository for the project of building large arabic multidomain lexicon for sentiment analysis using feature selection from multiple reso…☆16Jan 21, 2015Updated 11 years ago
- Pre-production releases for Spacy in Catalan☆14Nov 30, 2021Updated 4 years ago
- Improving Sentiment Analysis with Multi-task Learning of Negation☆14May 6, 2021Updated 4 years ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆102Sep 27, 2023Updated 2 years ago
- Maha is a text processing library specially developed to deal with Arabic text.☆214Mar 16, 2026Updated 3 weeks ago