defog-ai / defog-data
This repository contains the metadata and data of different databases that we use for testing
☆13Updated 2 months ago
Alternatives and similar repositories for defog-data:
Users that are interested in defog-data are comparing it to the libraries listed below
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆35Updated last year
- GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training☆103Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Using Large Language Models (LLMs) to convert natural language queries to sql☆43Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- Convert natural language query to appropriate SQL, make ERPs cool again.☆73Updated 4 years ago
- ☆72Updated 6 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆68Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- Translating natural language questions to a structured query language☆225Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)☆30Updated 8 months ago
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆55Updated 11 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆155Updated last year
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆102Updated 3 years ago
- code and supplementary materials for a series of Medium articles about the BERT model☆77Updated 2 years ago
- ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor☆294Updated 2 years ago
- ☆17Updated this week
- This repository contains the relevant materials for the tutorial "Legal IR and NLP: the History, Challenges, and State-of-the-Art", held …☆41Updated 2 years ago
- UNITE: A Unified Benchmark for Text-to-SQL Evaluation☆71Updated 11 months ago
- ☆23Updated 3 weeks ago
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago
- ☆72Updated 3 years ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆66Updated 8 months ago
- ☆47Updated last year
- A collection of task-specific NLU datasets☆149Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆267Updated 10 months ago
- WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.☆40Updated 9 months ago