RUC-NLPIR / OmniEval
Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"
☆42Updated 3 weeks ago
Alternatives and similar repositories for OmniEval:
Users that are interested in OmniEval are comparing it to the libraries listed below
- ☆17Updated last week
- The demo, code and data of FollowRAG☆68Updated 3 weeks ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆58Updated 2 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆108Updated 6 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆42Updated 6 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs☆53Updated 2 months ago
- The code and data of DPA-RAG☆54Updated 3 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆64Updated last week
- ☆51Updated 2 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆106Updated 2 months ago
- Code implementation of synthetic continued pretraining☆75Updated this week
- ☆33Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 2 weeks ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆57Updated 10 months ago
- ☆107Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆85Updated 2 months ago
- Towards Systematic Measurement for Long Text Quality☆31Updated 4 months ago
- Reformatted Alignment☆113Updated 3 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆64Updated last month
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆18Updated 10 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 5 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆135Updated 4 months ago
- Code for the paper: Metacognitive Retrieval-Augmented Large Language Models☆22Updated 10 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 11 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆73Updated 2 months ago
- ☆77Updated last year
- ☆38Updated last year
- ☆47Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆97Updated 3 months ago