mbzuai-oryx / KITAB-BenchLinks
[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
☆60Updated 6 months ago
Alternatives and similar repositories for KITAB-Bench
Users that are interested in KITAB-Bench are comparing it to the libraries listed below
Sorting:
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆49Updated 9 months ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 8 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- ☆128Updated last year
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆17Updated last year
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)☆58Updated 2 months ago
- ☆56Updated last year
- Large Language Models: In this repository Language models are introduced covering both theoretical and practical aspects.☆392Updated 2 months ago
- Code for Arabic Nougat☆50Updated last year
- [NAACL 2025 🔥] CAMEL-Bench is an Arabic benchmark for evaluating multimodal models across eight domains with 29,000 questions.☆34Updated 8 months ago
- هذا الدليل لمساعدة المهتمين في تعلم معالجة النصوص في اللغة العربية☆49Updated 8 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 4 months ago
- ☆36Updated 10 months ago
- [CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…☆45Updated 6 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- Arabic nested named entity recognition☆42Updated 9 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Updated 5 months ago
- ☆42Updated 4 months ago
- A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of dom…☆23Updated last year
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Updated 9 months ago
- ☆30Updated 5 months ago
- Bio-Medical EXpert LMM with English and Arabic Language Capabilities☆71Updated last month
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- ☆78Updated 3 weeks ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆17Updated 10 months ago