cuplv / text-to-sql-wizardcoder
Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom Spider training dataset. The resultant model, achieves 61% execution accuracy, incorporating database context for validation.
☆43Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for text-to-sql-wizardcoder
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆116Updated 6 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 7 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆47Updated 7 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆63Updated 3 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆48Updated last year
- ☆45Updated 2 years ago
- YuLan-IR: Information Retrieval Boosted LMs☆215Updated 8 months ago
- ☆129Updated 4 months ago
- The prediction results of ChatGPT on various datasets of Text-to-SQL.☆99Updated last year
- NaturalCodeBench (Findings of ACL 2024)☆56Updated last month
- ☆31Updated this week
- evol augment any dataset online☆55Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆47Updated 4 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆33Updated last month
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆27Updated 4 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆97Updated 8 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆179Updated last month
- This repository contains all the code for the DTS-SQL paper☆41Updated 3 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆133Updated 5 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- ☆26Updated last year
- ☆85Updated 2 weeks ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- Unofficial implementation of AlpaGasus☆84Updated last year
- ☆133Updated last year
- Open Source WizardCoder Dataset☆153Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆36Updated 2 weeks ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆108Updated 2 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆135Updated 3 months ago