IBM / sql-rl-gen
The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's training process in solving the specific text2SQL generation task.
☆12Updated last week
Alternatives and similar repositories for sql-rl-gen
Users that are interested in sql-rl-gen are comparing it to the libraries listed below
Sorting:
- ☆45Updated 9 months ago
- ☆45Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 7 months ago
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- ☆29Updated 6 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 5 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 6 months ago
- Official codebase for permutation self-consistency.☆18Updated last year
- [EMNLP 2024] TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation☆26Updated last month
- ☆57Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- ☆17Updated 3 weeks ago
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models☆21Updated 5 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 10 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆32Updated 10 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 7 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 3 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆52Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆42Updated last month
- ☆43Updated 3 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 5 months ago
- ☆45Updated 7 months ago
- ☆47Updated 11 months ago
- ☆22Updated 5 months ago
- ☆22Updated 5 months ago
- ☆11Updated last month