DataScienceUIBK / ArabicaQA
ArabicaQA: Comprehensive Dataset for Arabic Question Answering accepted at SIGIR 2024
☆13Updated 6 months ago
Alternatives and similar repositories for ArabicaQA:
Users that are interested in ArabicaQA are comparing it to the libraries listed below
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆24Updated 2 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated 7 months ago
- ☆120Updated 11 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆13Updated 3 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆16Updated 6 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆34Updated 11 months ago
- Efficient few-shot learning with cross-encoders.☆48Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆21Updated last month
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- ☆47Updated last year
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆67Updated 2 weeks ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 6 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated 3 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆119Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆97Updated 11 months ago
- ☆22Updated 11 months ago
- Completion After Prompt Probability. Make your LLM make a choice☆74Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆171Updated 5 months ago
- Web UI & Backend for Data Annotations in Aya☆26Updated 11 months ago
- MAFAND-MT☆55Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- Data extraction with LLM on CPU☆67Updated last year
- Arabic nested named entity recognition☆33Updated 9 months ago
- Efficient vector database for hundred millions of embeddings.☆206Updated 9 months ago
- ☆141Updated 7 months ago