Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆51Oct 31, 2024Updated last year
Alternatives and similar repositories for LeReT
Users that are interested in LeReT are comparing it to the libraries listed below
Sorting:
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 10 months ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- ☆14Apr 16, 2024Updated last year
- A RL env with procedurally generated symbolic reasoning data☆34Updated this week
- ☆13Jul 26, 2023Updated 2 years ago
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated last year
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆16Jan 24, 2025Updated last year
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19May 29, 2023Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- ☆20Oct 12, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆23Aug 18, 2024Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 2 months ago
- ☆105Mar 25, 2025Updated 11 months ago
- Codebase from our first release.☆45Feb 17, 2026Updated 2 weeks ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆99Feb 2, 2026Updated last month
- CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era☆30Jun 18, 2025Updated 8 months ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Nov 7, 2024Updated last year
- speed-running solving robot manipulation tasks☆24Oct 31, 2024Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 7 months ago
- MCP server integrating GEPA (Genetic-Evolutionary Prompt Architecture) for automatic prompt optimization with Claude Desktop☆46Nov 10, 2025Updated 3 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 10 months ago
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- [TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Thr…☆34Dec 5, 2025Updated 3 months ago
- Agentic Learning Powered by AWorld☆90Feb 13, 2026Updated 3 weeks ago
- This work has been accepted to Findings of EMNLP 2025!☆48Sep 5, 2025Updated 6 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- ☁️ Benchmarking LLMs for Cloud Config Generation | 云场景下的大模型基准测试☆39Oct 25, 2024Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆30Mar 28, 2024Updated last year