REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, post-training (SFT and Agentic RL)
☆86Feb 26, 2026Updated last month
Alternatives and similar repositories for REDSearcher
Users that are interested in REDSearcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆47Mar 15, 2025Updated last year
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆13Mar 6, 2025Updated last year
- SimKO: Simple Pass@K Policy Optimization☆30Oct 24, 2025Updated 5 months ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ICML'20: SIGUA: Forgetting May Make Learning with Noisy Labels More Robust☆17Dec 14, 2020Updated 5 years ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆21Mar 8, 2026Updated last month
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以及试题☆16Jun 26, 2023Updated 2 years ago
- 复旦研究生抢课脚本☆10Feb 14, 2022Updated 4 years ago
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Aug 14, 2024Updated last year
- [CVPR 23] Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!☆17May 14, 2024Updated last year
- Solutions to Ireland, Rosen exercises in "A Classical Introduction to Modern Number Theory"☆14Nov 7, 2024Updated last year
- 强化学习课程,主要是如何用强化学习解决问题☆15Dec 10, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A huge dataset for Document Visual Question Answering☆22Jul 29, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Updated this week
- ☆16Jul 29, 2025Updated 8 months ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 11 months ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆41Dec 23, 2025Updated 3 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 11 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repo contains the solutions of UC Berkeley CS 61B spring semester 2018, and materials including slides, lecture codes, exams and dis…☆15May 24, 2024Updated last year
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- Implementation of Evo-Memory style learning for LLM agents. Agents learn from outcomes, refine strategies, and get smarter with every tas…☆46Dec 3, 2025Updated 4 months ago
- ☆30Dec 29, 2025Updated 3 months ago
- HIT各种常用模板☆16Dec 6, 2019Updated 6 years ago
- A Survey of Multimodal Retrieval-Augmented Generation☆20Nov 3, 2025Updated 5 months ago
- CVE-Factory☆83Mar 27, 2026Updated 3 weeks ago
- The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"☆22Apr 22, 2025Updated 11 months ago
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MSTI☆16Mar 6, 2024Updated 2 years ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆19Oct 17, 2025Updated 6 months ago
- Code to generate the Inv3D dataset from our paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document …☆25Mar 6, 2024Updated 2 years ago
- ☆41Dec 7, 2025Updated 4 months ago
- Awesome papers, datasets and projects about the study of large language models like GPT-3, GPT-3.5, ChatGPT, GPT-4, etc.☆21Jun 10, 2023Updated 2 years ago
- ☆42Feb 12, 2026Updated 2 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆56Dec 23, 2025Updated 3 months ago