GuoqingWang1 / WebFilterLinks
πOfficial code of our AAAI26 paper πWebFilter
β30Updated last week
Alternatives and similar repositories for WebFilter
Users that are interested in WebFilter are comparing it to the libraries listed below
Sorting:
- β51Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).β11Updated last month
- β98Updated 3 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Modelsβ39Updated last year
- β50Updated 5 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ148Updated 4 months ago
- β52Updated 3 months ago
- β36Updated last year
- WideSearch: Benchmarking Agentic Broad Info-Seekingβ101Updated last month
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimizationβ42Updated 8 months ago
- β170Updated this week
- β45Updated last month
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β60Updated last year
- Scaling Preference Data Curation via Human-AI Synergyβ126Updated 4 months ago
- Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Frameworkβ29Updated 3 months ago
- β46Updated 5 months ago
- MiroTrain is an efficient and algorithm-first framework for post-training large agentic models.β91Updated 2 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encodingβ19Updated last year
- β38Updated 3 months ago
- β23Updated last year
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learningβ40Updated 4 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Searchβ62Updated 4 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMsβ24Updated last year
- β36Updated 4 months ago
- RewardAnything: Generalizable Principle-Following Reward Modelsβ44Updated 5 months ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"β82Updated last year
- SSRL: Self-Search Reinforcement Learningβ151Updated 2 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Modelsβ63Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paperβ33Updated last year
- β95Updated 11 months ago