[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
☆67May 24, 2025Updated 11 months ago
Alternatives and similar repositories for KITAB-Bench
Users that are interested in KITAB-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 11 months ago
- [NAACL'25] Contains code and documentation for our VANE-Bench paper.☆24Aug 19, 2025Updated 9 months ago
- ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark☆17May 25, 2025Updated 11 months ago
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…☆79Sep 24, 2024Updated last year
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆18Aug 29, 2024Updated last year
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆196Jan 30, 2026Updated 3 months ago
- Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)☆20Aug 24, 2023Updated 2 years ago
- This repository contains the official source code for SALT: Parameter-Efficient Fine-Tuning via Singular Value Adaptation with Low-Rank T…☆30Nov 29, 2025Updated 5 months ago
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- This repository contains the code for Optimizing Brain Tumor Segmentation with MedNeXt: BraTS 2024 SSA and Pediatrics (MICCAI'24)☆27Mar 22, 2025Updated last year
- Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]☆22Oct 27, 2024Updated last year
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆54Mar 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)☆22Dec 17, 2025Updated 5 months ago
- Open Source tool for Arabic text readability☆24Jul 4, 2025Updated 10 months ago
- Composed Video Retrieval☆62May 2, 2024Updated 2 years ago
- Interview questions asked in Data Science/ Machine Learning interviews☆19Jan 15, 2020Updated 6 years ago
- هذا الدليل لمساعدة المهتمين في تعلم معالجة النصوص في اللغة العربية☆50Apr 9, 2025Updated last year
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 11 months ago
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆29Oct 20, 2025Updated 7 months ago
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…☆13Feb 28, 2024Updated 2 years ago
- ☆12Mar 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆85May 18, 2024Updated 2 years ago
- [BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"☆31Dec 18, 2025Updated 5 months ago
- CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming☆11Nov 18, 2024Updated last year
- Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".☆25Jul 10, 2023Updated 2 years ago
- [CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection☆31Jun 21, 2023Updated 2 years ago
- Reinforcement Training of Robot☆11Dec 1, 2019Updated 6 years ago
- [⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing fro…☆31Apr 22, 2025Updated last year
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking☆33Mar 14, 2025Updated last year
- 【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval☆93Apr 16, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10May 12, 2021Updated 5 years ago
- [CVPR 2026 🔥] Time Blindness: Why Video-Language Models Can't See What Humans Can?☆62Jan 28, 2026Updated 3 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated 4 months ago
- ☆11May 1, 2021Updated 5 years ago
- Datasette plugin providing a UI for executing SQL writes against the database☆12Nov 11, 2025Updated 6 months ago
- A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.☆19Dec 11, 2025Updated 5 months ago
- Official repository of FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis☆62Feb 5, 2026Updated 3 months ago