defog-ai / defog-dataLinks
This repository contains the metadata and data of different databases that we use for testing
☆14Updated last year
Alternatives and similar repositories for defog-data
Users that are interested in defog-data are comparing it to the libraries listed below
Sorting:
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆68Updated last year
- ☆141Updated 3 months ago
- ☆82Updated 3 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆45Updated 2 years ago
- Convert natural language query to appropriate SQL, make ERPs cool again.☆74Updated 5 years ago
- Evaluation of bm42 sparse indexing algorithm☆72Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- ☆36Updated last year
- Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)☆31Updated last year
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆170Updated last year
- ☆43Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Updated 3 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Updated 2 years ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆84Updated last year
- Model implementation for the contextual embeddings project☆40Updated 8 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆42Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆113Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆136Updated last year
- Using Large Language Models (LLMs) to convert natural language queries to sql☆54Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 3 months ago
- Code for KaLM-Embedding models☆113Updated 7 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆126Updated 3 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆116Updated 6 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆67Updated last year