smallporridge / TrustworthyRAGLinks
☆17Updated last year
Alternatives and similar repositories for TrustworthyRAG
Users that are interested in TrustworthyRAG are comparing it to the libraries listed below
Sorting:
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 9 months ago
- ☆45Updated last week
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆46Updated 8 months ago
- ☆13Updated 8 months ago
- ☆38Updated last month
- ☆22Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆24Updated 2 months ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆34Updated 2 months ago
- ☆18Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆39Updated 3 weeks ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆23Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆35Updated last year
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆26Updated 3 weeks ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆19Updated 9 months ago
- ☆21Updated 5 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆25Updated last week
- ☆98Updated last month
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆44Updated 3 months ago
- ☆36Updated 3 weeks ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆19Updated 10 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆24Updated last month
- ☆30Updated last month
- ☆23Updated 6 months ago
- ☆16Updated last year
- ☆18Updated 2 months ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆25Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆76Updated 3 weeks ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆43Updated 3 weeks ago
- The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"☆74Updated last week