IAAR-Shanghai / NewsBenchLinks
[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
☆33Updated last year
Alternatives and similar repositories for NewsBench
Users that are interested in NewsBench are comparing it to the libraries listed below
Sorting:
- The demo, code and data of FollowRAG☆74Updated 2 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆30Updated 3 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆62Updated 7 months ago
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆25Updated 2 weeks ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆246Updated 3 weeks ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆58Updated 3 months ago
- ☆83Updated last year
- ☆58Updated 10 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆128Updated 4 months ago
- ☆159Updated 7 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆76Updated this week
- ☆54Updated 11 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆102Updated 3 months ago
- ☆67Updated 2 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆43Updated 6 months ago
- Controllable Text Generation for Large Language Models: A Survey☆185Updated last year
- ☆134Updated this week
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆148Updated 8 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆68Updated 3 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆71Updated 4 months ago
- ☆95Updated 2 weeks ago
- Scaling Preference Data Curation via Human-AI Synergy☆105Updated 2 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆173Updated 2 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆128Updated 10 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆134Updated 11 months ago
- ☆146Updated last year
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆19Updated last year
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆132Updated last month
- ☆147Updated 3 months ago