DataScienceUIBK / ArabicaQALinks
ArabicaQA: Comprehensive Dataset for Arabic Question Answering accepted at SIGIR 2024
☆17Updated last year
Alternatives and similar repositories for ArabicaQA
Users that are interested in ArabicaQA are comparing it to the libraries listed below
Sorting:
- ☆126Updated last year
- Generalist and Lightweight Model for Text Classification☆164Updated 4 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 11 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 7 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆76Updated 6 months ago
- ☆124Updated 8 months ago
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆243Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆114Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆105Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 3 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆165Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated last year
- Testing and evaluation framework for voice agents☆154Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 9 months ago
- 📚 Datasets and models for instruction-tuning☆237Updated 2 years ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆239Updated this week
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated 2 years ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year
- ☆115Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆298Updated 5 months ago
- Data extraction with LLM on CPU☆68Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆338Updated 5 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆190Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year