A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
☆21Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for nmatheg
Users that are interested in nmatheg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic Art using GANs☆17Aug 3, 2022Updated 3 years ago
- Explore the content of Arabic text datasets.☆18May 23, 2022Updated 4 years ago
- Arabic cleaning, normalization and segmentation library.☆76Sep 28, 2023Updated 2 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆111Jan 4, 2024Updated 2 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 10 months ago
- Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.☆38Jan 3, 2023Updated 3 years ago
- Al-Faraheedy Project☆23Jun 20, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Nov 16, 2020Updated 5 years ago
- ☆41Dec 25, 2022Updated 3 years ago
- Arabic light stemmer. Light stemming for Arabic words removes prefixes and suffixes and normalizes words☆19Dec 16, 2021Updated 4 years ago
- ☆14Apr 12, 2019Updated 7 years ago
- ArWordVec is a collection of pre-trained word embedding model built from huge repository of Arabic tweets in different topics. The aim of…☆19Jul 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 5 years ago
- Hijri date library on top of Carbon☆23Mar 9, 2025Updated last year
- Arabic Support for Festival speech synthesis system☆11Sep 30, 2019Updated 6 years ago
- Benchmark Arabic text diacritization dataset☆79Apr 7, 2026Updated 2 months ago
- Personalized Response Generation via Generative Split Memory Network☆12Sep 6, 2021Updated 4 years ago
- Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…☆18Mar 26, 2021Updated 5 years ago
- Collection of various Arabic NLP and Text Processing Scripts and Utilities☆60Oct 10, 2013Updated 12 years ago
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.☆156Jun 24, 2024Updated last year
- Arabic speech recognition, classification and text-to-speech.☆429Sep 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An elegant wrapper around Google Vision API☆27Nov 8, 2021Updated 4 years ago
- ☆24Sep 26, 2025Updated 8 months ago
- A decentralized alternative to proprietary and centralized cloud storage.☆15Mar 8, 2023Updated 3 years ago
- AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP researc…☆423Apr 4, 2021Updated 5 years ago
- Fake News Detection: Model implementations and Hyper-Parameters☆21Jul 3, 2020Updated 5 years ago
- Cookiecutter PyTorch Lightning☆12Sep 7, 2021Updated 4 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆167Aug 4, 2023Updated 2 years ago
- A basic example of building a chat bot by applying it on the domain of Islamic Fatwa.☆16Jan 21, 2018Updated 8 years ago
- Pre-training BART in Flax on The Pile dataset☆22Jul 24, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆18May 27, 2023Updated 3 years ago
- Shami Dialect Corpus (SDC)☆29Feb 13, 2018Updated 8 years ago
- ☆10Nov 6, 2024Updated last year
- Using Styleganv2 to generate mosaics.☆51Dec 29, 2020Updated 5 years ago
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆45Jan 8, 2026Updated 5 months ago
- A simple x86 EFI bootloader for Android™ boot images☆29Feb 16, 2019Updated 7 years ago