DataScienceUIBK / ArabicaQALinks
ArabicaQA: Comprehensive Dataset for Arabic Question Answering accepted at SIGIR 2024
☆18Updated last year
Alternatives and similar repositories for ArabicaQA
Users that are interested in ArabicaQA are comparing it to the libraries listed below
Sorting:
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated last year
- Generalist and Lightweight Model for Text Classification☆169Updated last week
- ☆127Updated last year
- A library for working with prompt templates locally or on the Hugging Face Hub.☆52Updated 10 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆122Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆184Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- ☆125Updated 11 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 9 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Updated 8 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆79Updated 9 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆83Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- Agentic RAG to help you build a startup🚀☆55Updated 9 months ago
- Experimental tl;dr summaries for datasets on the Hugging Face Hub!☆10Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated 2 years ago
- Data extraction with LLM on CPU☆112Updated 2 years ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Data extraction with LLM on CPU☆68Updated 2 years ago
- A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches☆81Updated 10 months ago
- ☆48Updated 2 years ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 4 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- ☆21Updated last year
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆147Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year