REDSearch: A scalable, cost-efficient framework for long-horizon search agents. Features complex task synthesis, optimized mid-training, post-training (SFT and Agentic RL)
☆98Feb 26, 2026Updated 2 months ago
Alternatives and similar repositories for REDSearcher
Users that are interested in REDSearcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆47Mar 15, 2025Updated last year
- Working with images in frequency space☆10Nov 5, 2020Updated 5 years ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆35Jun 29, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆14Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A PyTorch Lightning template to try out a wide range of ideas on the Ubiquant Market Prediction competition without modifying any code!☆12Mar 24, 2022Updated 4 years ago
- ☆39Mar 26, 2026Updated last month
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 6 months ago
- [Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances☆20Jan 15, 2026Updated 3 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- ☆80Jun 20, 2025Updated 10 months ago
- ☆11Sep 5, 2023Updated 2 years ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).☆14Dec 2, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以及试题☆16Jun 26, 2023Updated 2 years ago
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated 3 months ago
- 3 experiments for Pattern Recognition course in USTC 2020fall☆10Jan 25, 2021Updated 5 years ago
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Aug 14, 2024Updated last year
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Low Rank Global Attention for Graph Neural Networks☆12Aug 5, 2020Updated 5 years ago
- universal Association Rule Mining Solver☆15Dec 10, 2025Updated 4 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆23Mar 8, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Solutions to Ireland, Rosen exercises in "A Classical Introduction to Modern Number Theory"☆14Nov 7, 2024Updated last year
- ☆12Jan 3, 2022Updated 4 years ago
- A huge dataset for Document Visual Question Answering☆23Jul 29, 2024Updated last year
- 强化学习课程,主要是如何用强化学习解决问题☆15Dec 10, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 3 weeks ago
- ☆16Jul 29, 2025Updated 9 months ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆13Nov 14, 2019Updated 6 years ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆41Dec 23, 2025Updated 4 months ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated last year
- Simple implement dilated LSTM, residual LSTM and Attention LSTM (follow the corresponding papers).☆17Dec 26, 2019Updated 6 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- ☆14Aug 26, 2018Updated 7 years ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated last year