1337-Artificial-Intelligence / hackai_appLinks
☆19Updated 2 months ago
Alternatives and similar repositories for hackai_app
Users that are interested in hackai_app are comparing it to the libraries listed below
Sorting:
- Awesome Darija Arabic NLP Resources☆18Updated 5 months ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆91Updated 2 years ago
- 4-day AI hackathon in 1337 Benguerir, Morocco☆49Updated 3 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 9 months ago
- A list of Moroccan Darija Datasets grouped by name, data source, region and size.☆55Updated last year
- This repo is for semantic search app to search over Quran tafsir books☆24Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆112Updated 11 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- ☆125Updated last year
- TODa: Tamazight Open Dataset☆16Updated 8 months ago
- Egyptian ID Card Recognition System 💳 A Python-based application to detect and process Egyptian ID cards using YOLO and EasyOCR.☆27Updated 7 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Arabic cleaning, normalization and segmentation library.☆71Updated last year
- Dvoice est un outil de reconnaissance vocale pour les dialectes et les langues peu représentées.☆33Updated 3 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆41Updated 5 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆178Updated 3 months ago
- ☆44Updated 4 months ago
- هذا الدليل لمساعدة المهتمين في تعلم معالجة النصوص في اللغة العربية☆49Updated 5 months ago
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 2 months ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆81Updated last month
- An effort to benchmark Arabic legal reasoning in foundation models.☆14Updated 4 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆129Updated last week
- A CLI for generating synthetic data☆42Updated 4 months ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- A comprehensive list of Arabic NLP resources.☆35Updated 2 weeks ago
- ☆58Updated last year
- Let's build better datasets, together!☆263Updated 9 months ago