StibiumT16 / Robust-Fine-tuningLinks
Code for Robust Fine-tuning (RbFT)
☆16Updated last year
Alternatives and similar repositories for Robust-Fine-tuning
Users that are interested in Robust-Fine-tuning are comparing it to the libraries listed below
Sorting:
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆35Updated 3 months ago
- ☆26Updated 8 months ago
- Agentic Learning Powered by AWorld☆80Updated last week
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆49Updated 9 months ago
- ☆13Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 6 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆49Updated last year
- ☆96Updated last year
- ☆28Updated last week
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆41Updated 4 months ago
- ☆31Updated last year
- ☆18Updated last year
- Automatic prompt optimization framework for multi-step agent tasks.☆36Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆41Updated 5 months ago
- PGRAG☆52Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆45Updated 7 months ago
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆28Updated 3 months ago
- ☆36Updated last year
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆32Updated 3 months ago
- FuseAI Project☆87Updated last year
- This is a meta-model distilled from LLMs for information extraction. This is an intermediate checkpoint that can be well-transferred to a…☆28Updated 11 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆33Updated 9 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Updated 11 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆21Updated 8 months ago
- ☆30Updated last year
- ☆62Updated last year
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Updated last year