Alibaba-NLP / ZeroSearch
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
☆321Updated this week
Alternatives and similar repositories for ZeroSearch:
Users that are interested in ZeroSearch are comparing it to the libraries listed below
- This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.☆26Updated last week
- ☆91Updated last month
- ☆40Updated this week
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆68Updated 6 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆35Updated 2 months ago
- ☆38Updated 6 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆53Updated 6 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆42Updated 5 months ago
- ☆46Updated last week
- ☆45Updated last month
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated 4 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 2 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 7 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆24Updated this week
- [Preprint] A Generalizable and Purely Unsupervised Self-Training Framework☆56Updated 3 weeks ago
- The demo, code and data of FollowRAG☆72Updated 2 weeks ago
- ☆37Updated 3 weeks ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 4 months ago
- ☆40Updated 2 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆94Updated 3 weeks ago
- ☆40Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆141Updated 2 weeks ago
- ☆47Updated 2 months ago
- ☆85Updated 6 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆49Updated 9 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆44Updated 3 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆60Updated 3 months ago
- SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator☆71Updated 4 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆112Updated last week