yechens / NL2SQL
Text2SQL 语义解析数据集、解决方案、paper资源整合项目
☆1,244Updated last year
Alternatives and similar repositories for NL2SQL:
Users that are interested in NL2SQL are comparing it to the libraries listed below
- 追一科技首届中文NL2SQL挑战赛决赛第3名方案+代码☆537Updated last month
- 自然语言转SQL,直接连接数据库查询☆371Updated last year
- unified embedding model☆847Updated last year
- A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in T…☆1,550Updated 4 months ago
- NL2SQL competition dataset☆190Updated last year
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆518Updated 8 months ago
- 北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对…☆1,760Updated last year
- 记录本人整理的一些数据集☆1,021Updated 2 years ago
- An Open-sourced Knowledgable Large Language Model Framework.☆1,260Updated this week
- PaddleNLP UIE模型的PyTorch版实现☆609Updated last year
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆800Updated 7 months ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆946Updated 4 months ago
- scripts and baselines for CSpider: Chinese semantic parsing and text-to-SQL challenge☆174Updated 3 years ago
- 收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中☆2,188Updated last year
- Unified Structure Generation for Universal Information Extraction☆909Updated 2 years ago
- PromptCLUE, 全中文任务支持零样本学习模型☆659Updated last year
- 中文命名实体识别。包含目前最新的中文命名实体识别论文、中文实体识别相关工具、数据集,以及中文预训练模型、词向量、实体识别综述等。☆648Updated 3 weeks ago
- Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.☆2,105Updated this week
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,233Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,704Updated last year
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,794Updated 9 months ago
- 轩辕:度小满中文金融对话大模型☆1,123Updated last week
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆282Updated 5 months ago
- 开源SFT数据集整理,随时补充☆474Updated last year
- Chinese medical dialogue data 中文医疗对话数据集☆1,293Updated last year
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,587Updated 2 weeks ago
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,571Updated last month
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆982Updated 8 months ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,050Updated 7 months ago
- Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]☆1,665Updated last year