cuplv / text-to-sql-wizardcoder
Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.
☆43Updated 9 months ago
Related projects: ⓘ
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆58Updated last month
- YuLan-IR: Information Retrieval Boosted LMs☆211Updated 6 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆27Updated 2 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆104Updated 4 months ago
- The prediction results of ChatGPT on various datasets of Text-to-SQL.☆98Updated last year
- ☆124Updated 2 months ago
- Using Large Language Models (LLMs) to convert natural language queries to sql☆36Updated last year
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆47Updated last year
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆95Updated last month
- ☆26Updated 10 months ago
- evol augment any dataset online☆55Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web"☆106Updated this week
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆128Updated 2 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆63Updated 5 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆98Updated 2 weeks ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆112Updated 2 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆27Updated 3 weeks ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆95Updated 6 months ago
- ☆52Updated last month
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆39Updated 2 months ago
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆71Updated 2 months ago
- TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios☆122Updated last month
- ☆44Updated 2 years ago
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- ☆32Updated 2 weeks ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆101Updated last week
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆47Updated 5 months ago
- ☆131Updated last year
- ☆111Updated 6 months ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆49Updated last year