Explore the content of Arabic text datasets.
☆18May 23, 2022Updated 3 years ago
Alternatives and similar repositories for bayanat
Users that are interested in bayanat are comparing it to the libraries listed below
Sorting:
- Arabic cleaning, normalization and segmentation library.☆74Sep 28, 2023Updated 2 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Jan 4, 2024Updated 2 years ago
- Arabic Art using GANs☆17Aug 3, 2022Updated 3 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆16May 27, 2023Updated 2 years ago
- Arabic poetry analysis and generation.☆23Jul 23, 2023Updated 2 years ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆36Apr 14, 2022Updated 3 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- أفكار لمشروع رقمنة الكتب عن طريق الجهد الموزع☆20Dec 18, 2021Updated 4 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated 11 months ago
- Sentiment analysis on Algerian and modern standard Arabic using deep learning and SVM☆14Apr 18, 2020Updated 5 years ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆194Jan 30, 2026Updated last month
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- A Python implementation of Farasa toolkit☆139Sep 11, 2025Updated 6 months ago
- Python package for Arabic natural language processing☆28Jun 12, 2019Updated 6 years ago
- ☆10Oct 8, 2018Updated 7 years ago
- Shami Dialect Corpus (SDC)☆29Feb 13, 2018Updated 8 years ago
- This repo will hold all the material related to the talks and workshops that took place during IndabaXEgypt'19☆18Jul 8, 2019Updated 6 years ago
- A jQuery plugin to display a calendar heatmap like Github's contributions timeline.☆11Aug 31, 2019Updated 6 years ago
- A central hub for translating stuff into Arabic (Join our Discord Server, if you want to help)☆49Jan 12, 2026Updated 2 months ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆26Feb 18, 2021Updated 5 years ago
- Ivanti Pulse Secure CVE-2023-46805 Scanner - Based on Assetnote's Research☆12Jan 19, 2024Updated 2 years ago
- ☆20Feb 25, 2021Updated 5 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- ☆40Dec 25, 2022Updated 3 years ago
- An automated twitter robot which built with Python + Selenium to crawl last online stores deals and tweet about them.☆34Jun 30, 2019Updated 6 years ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆12Jan 30, 2020Updated 6 years ago
- ☆12May 21, 2020Updated 5 years ago
- Burp Suite Extensions☆12Oct 19, 2021Updated 4 years ago
- Arabic Stop Word List☆36Jan 11, 2024Updated 2 years ago
- ☆10Nov 6, 2024Updated last year
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- ☆30Feb 11, 2022Updated 4 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆165Aug 4, 2023Updated 2 years ago
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Arabic typing practice☆14Jun 27, 2019Updated 6 years ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆10Oct 14, 2024Updated last year
- OSC'21 Sessions Documentations.☆13Sep 1, 2021Updated 4 years ago