UM6P-EMINES / Awesome-Darija-Arabic-NLP-ResourcesLinks
Awesome Darija Arabic NLP Resources
☆18Updated 5 months ago
Alternatives and similar repositories for Awesome-Darija-Arabic-NLP-Resources
Users that are interested in Awesome-Darija-Arabic-NLP-Resources are comparing it to the libraries listed below
Sorting:
- ☆19Updated 3 months ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆92Updated 2 years ago
- TODa: Tamazight Open Dataset☆16Updated 9 months ago
- This repo is for semantic search app to search over Quran tafsir books☆24Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 10 months ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆175Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 3 years ago
- Chat with Towards Data Science☆14Updated last year
- A list of Moroccan Darija Datasets grouped by name, data source, region and size.☆55Updated last year
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- ☆125Updated last year
- A system that allows to edit images with hand gestures captured by a webcam☆22Updated 2 years ago
- A Java toolkit to generate multi fonts Arabic text images☆11Updated 4 years ago
- ☆124Updated 11 months ago
- 4-day AI hackathon in 1337 Benguerir, Morocco☆50Updated 4 months ago
- Fine tune Gemma 3 on an object detection task☆86Updated 3 months ago
- ☆34Updated 8 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 2 months ago
- ☆127Updated 6 months ago
- a python package for loadimg and converting images☆28Updated 2 months ago
- ☆14Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 4 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- building a Large Language Model (LLM) from scratch.☆34Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- ☆207Updated 4 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆41Updated 6 months ago
- A template to kick-start your Python project ✨🚀☆52Updated 2 months ago