defog-ai / defog-data
This repository contains the metadata and data of different databases that we use for testing
☆13Updated last week
Related projects ⓘ
Alternatives and complementary repositories for defog-data
- ☆13Updated this week
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆149Updated last year
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆34Updated last year
- GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training☆101Updated 8 months ago
- Open-Domain Question Answering Goes Conversational via Question Rewriting☆143Updated 2 years ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated 11 months ago
- ☆97Updated 2 years ago
- ☆30Updated 2 years ago
- ☆37Updated last year
- ☆37Updated 4 months ago
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆98Updated 3 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- A multi-purpose toolkit for table-to-text generation: web interface, Python bindings, CLI commands.☆54Updated 6 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- ☆83Updated 2 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆134Updated 10 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆106Updated last month
- This project makes available the code and data from our NAACL paper: "Capturing Row and Column Semantics in Transformer Based Question An…☆56Updated last year
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆29Updated last year
- Code, data, and model of paper "Text-to-SQL Error Correction with Language Models of Code" (ACL'23)☆30Updated 3 months ago
- ☆15Updated 5 months ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆35Updated 7 months ago
- A T5 based sequence generation model for WikiSQL task. Achieving 90.3% on test data set using sequence generation.☆17Updated 4 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆100Updated 10 months ago
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆31Updated 6 months ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- ☆131Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆71Updated 6 months ago
- ☆76Updated 11 months ago