pat-jj / DeepRetrieval
DeepRetrieval - Hacking π₯Real Search Engines and Retrievers with LLM via RL
β487Updated last week
Alternatives and similar repositories for DeepRetrieval
Users that are interested in DeepRetrieval are comparing it to the libraries listed below
Sorting:
- adds Sequence Parallelism into LLaMA-Factoryβ482Updated last week
- β¨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framworkβ213Updated last month
- BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?β570Updated last week
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTSβ1,173Updated last month
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writingβ141Updated last month
- Codebase for Iterative DPO Using Rule-based Rewardsβ243Updated last month
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://aβ¦β300Updated 5 months ago
- In-depth study of the graphragβ1,272Updated last week
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Modelsβ133Updated 4 months ago
- β45Updated last month
- Multilingual Corpus of Web Fictionβ191Updated 10 months ago
- β1,379Updated 7 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Mergingβ135Updated last month
- Unified KV Cache Compression Methods for Auto-Regressive Modelsβ1,051Updated 4 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.β164Updated 6 months ago
- β57Updated 2 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Modelsβ43Updated 3 months ago
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challengesβ571Updated 3 weeks ago
- The official implementation of Self-Play Preference Optimization (SPPO)β548Updated 3 months ago
- R1-like Computer-use Agentβ69Updated last month
- Skywork-R1V2:Multimodal Hybrid Reinforcement Learning for Reasoning (Best open-source multimodal reasoning mβ¦β2,459Updated this week
- β256Updated last month
- minimal-cost for training 0.5B R1-Zeroβ716Updated 2 weeks ago
- "GraphAgent: Agentic Graph Language Assistant"