PALIN2018 / BrowseComp-ZHLinks
☆88Updated 2 months ago
Alternatives and similar repositories for BrowseComp-ZH
Users that are interested in BrowseComp-ZH are comparing it to the libraries listed below
Sorting:
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆93Updated 2 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆91Updated 5 months ago
- ☆49Updated last year
- ☆78Updated last week
- ☆103Updated 8 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 2 months ago
- ☆144Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆58Updated 10 months ago
- Code implementation of synthetic continued pretraining☆123Updated 7 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated 2 months ago
- ☆95Updated 7 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆62Updated 2 months ago
- ☆70Updated 6 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Updated 9 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆147Updated 7 months ago
- ☆56Updated 9 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆170Updated last month
- ☆83Updated last year
- ☆104Updated 3 weeks ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆166Updated last month
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆51Updated last year
- ☆159Updated 3 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆57Updated last week
- ☆108Updated last year
- ☆154Updated 6 months ago
- ☆36Updated 11 months ago
- The demo, code and data of FollowRAG☆74Updated last month
- Fantastic Data Engineering for Large Language Models☆89Updated 7 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆39Updated 3 weeks ago