Explore the content of Arabic text datasets.
☆18May 23, 2022Updated 3 years ago
Alternatives and similar repositories for bayanat
Users that are interested in bayanat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic cleaning, normalization and segmentation library.☆76Sep 28, 2023Updated 2 years ago
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆111Jan 4, 2024Updated 2 years ago
- Arabic Art using GANs☆17Aug 3, 2022Updated 3 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆18May 27, 2023Updated 2 years ago
- Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.☆37Jan 3, 2023Updated 3 years ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆36Apr 14, 2022Updated 4 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- أفكار لمشروع رقمنة الكتب عن طريق الجهد الموزع☆20Dec 18, 2021Updated 4 years ago
- Everyday Arabic-English Scene Text dataset☆16Oct 14, 2021Updated 4 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆196Jan 30, 2026Updated 3 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Python implementation of Farasa toolkit☆140Sep 11, 2025Updated 7 months ago
- Python package for Arabic natural language processing☆28Jun 12, 2019Updated 6 years ago
- Maha is a text processing library specially developed to deal with Arabic text.☆216Updated this week
- ☆10Oct 8, 2018Updated 7 years ago
- This repo will hold all the material related to the talks and workshops that took place during IndabaXEgypt'19☆18Jul 8, 2019Updated 6 years ago
- Cookiecutter PyTorch Lightning☆12Sep 7, 2021Updated 4 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 5 years ago
- Arabic nested named entity recognition☆46Mar 10, 2025Updated last year
- ☆40Dec 25, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An automated twitter robot which built with Python + Selenium to crawl last online stores deals and tweet about them.☆35Jun 30, 2019Updated 6 years ago
- Burp Suite extension designed to help security professionals search for custom sensitive information in HTTP responses☆11Apr 25, 2023Updated 3 years ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆12Jan 30, 2020Updated 6 years ago
- ☆12May 21, 2020Updated 5 years ago
- ☆10Jul 21, 2023Updated 2 years ago
- ☆10Nov 6, 2024Updated last year
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆40Jan 8, 2026Updated 3 months ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 9 months ago
- Project work from the Udacity machine learning nanodegree encompassing general techniques for supervised,unsupervised and reinforcement l…☆11Mar 15, 2017Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30Feb 11, 2022Updated 4 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆166Aug 4, 2023Updated 2 years ago
- Arabic edition of BERT pretrained language models☆134Dec 5, 2020Updated 5 years ago
- The official implementation of CATT Arabic diacritization models.☆69Jul 18, 2025Updated 9 months ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Curated list of Moroccans publishing in the most prestigious AI conferences☆11Oct 14, 2024Updated last year
- OSC'21 Sessions Documentations.☆13Sep 1, 2021Updated 4 years ago