defog-ai / defog-dataLinks
This repository contains the metadata and data of different databases that we use for testing
☆14Updated last year
Alternatives and similar repositories for defog-data
Users that are interested in defog-data are comparing it to the libraries listed below
Sorting:
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆68Updated last year
- ☆82Updated 3 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Updated 2 years ago
- ☆144Updated 3 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆67Updated last year
- ☆36Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated last year
- ☆61Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆45Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆42Updated 2 years ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆242Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated last week
- Evaluation of bm42 sparse indexing algorithm☆72Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆151Updated last year
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆134Updated 5 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆259Updated 3 weeks ago
- Evaluating tool-augmented LLMs in conversation settings☆88Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Updated last year
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆196Updated 5 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆126Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Code for KaLM-Embedding models☆113Updated 7 months ago
- Using Large Language Models (LLMs) to convert natural language queries to sql☆54Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated 4 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆74Updated last year
- Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)☆31Updated last year
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation☆84Updated 8 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆81Updated 9 months ago