IBM / sql-rl-genLinks

The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's training process in solving the specific text2SQL generation task.

☆14

Alternatives and similar repositories for sql-rl-gen

Users that are interested in sql-rl-gen are comparing it to the libraries listed below

Sorting:

dheeraj7596 / Small2Large
☆17Updated last year
nuochenpku / COMEDY
This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…
☆19Updated 7 months ago
neulab / data-agora
[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆33Updated 6 months ago
MurongYue / LLM_MoT_cascade
This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…
☆23Updated last year
castorini / perm-sc
Official codebase for permutation self-consistency.
☆18Updated last year
THU-KEG / SeaKR
☆28Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆44Updated 11 months ago
LiqiangJing / DSBench
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆55Updated 4 months ago
amazon-science / synthesizrr
Synthesizing realistic and diverse text-datasets from augmented LLMs
☆12Updated 2 months ago
xsc1234 / Search-in-the-Chain
Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks
☆57Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆78Updated 9 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 9 months ago
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆23Updated 2 months ago
technion-cs-nlp / hallucination-mitigation
☆22Updated 6 months ago
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆48Updated 6 months ago
mungg / FABLES
☆57Updated 9 months ago
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆31Updated 7 months ago
RUCAIBox / ChainLM
☆29Updated last year
DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆63Updated last year
gangiswag / cornstack
☆34Updated last week
awslabs / rag-qa-arena
☆45Updated 10 months ago
likenneth / persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
☆17Updated last year
gangiswag / llm-reranker
☆46Updated 5 months ago
NVIDIA / When2Call
A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions
☆23Updated last month
KwanWaiChung / MT-Eval
Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
☆41Updated 8 months ago
hanqi-qi / Mirror
☆12Updated last year
LuLuLuyi / LongHeads
LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆29Updated last year
OpenMatch / ActiveRAG
This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".
☆107Updated 8 months ago