Ruiyang-061X/Awesome-Search-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ruiyang-061X/Awesome-Search-RL)

Ruiyang-061X / Awesome-Search-RL

☆44

Alternatives and similar repositories for Awesome-Search-RL

Users that are interested in Awesome-Search-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ruiyang-061X / Awesome-MLLM-Reasoning
View on GitHub
📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.
☆13Feb 7, 2025Updated last year
quqxui / MemGAS
View on GitHub
☆18Mar 15, 2026Updated 4 months ago
WxxShirley / Agent-STAR
View on GitHub
Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"
☆32May 12, 2026Updated 2 months ago
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆70May 13, 2025Updated last year
Rh-Dang / ECBench
View on GitHub
A Holistic Embodied Cognition Benchmark
☆18Apr 3, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Aofei-Chang / MedHEval
View on GitHub
Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"
☆16Apr 23, 2025Updated last year
Jack-ZC8 / M3AV-dataset
View on GitHub
[ACL 2024] A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset
☆24May 29, 2025Updated last year
Ruiyang-061X / Uncertainty-o
View on GitHub
✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…
☆21Mar 13, 2025Updated last year
Ruiyang-061X / SketchThinker-R1
View on GitHub
[ICLR'26] SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models
☆17Mar 26, 2026Updated 3 months ago
Zyphra / transformers_zamba2
View on GitHub
☆49Feb 5, 2025Updated last year
WxxShirley / CIKM2023DiRec
View on GitHub
Codes, data, and baselines for CIKM 2023 Long Paper "Dual Intents Graph Modeling for User-centric Group Discovery"
☆17Oct 22, 2023Updated 2 years ago
wizardlancet / diagnosis_zero
View on GitHub
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆32Jul 24, 2025Updated 11 months ago
ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools
View on GitHub
A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.
☆46Dec 17, 2025Updated 7 months ago
patrick-tssn / Awesome-Multimodal-Memory
View on GitHub
[TMLR 2025] Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, a…
☆69Jan 17, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HIT-SCIR / Abacus
View on GitHub
珠算代码大模型（Abacus Code LLM）
☆57Sep 26, 2024Updated last year
NUSTM / LLMs-Waver-In-Judgments
View on GitHub
☆12Sep 23, 2024Updated last year
nju-websoft / HuggingBench
View on GitHub
[SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph
☆16Jun 6, 2025Updated last year
tdlhl / RAD
View on GitHub
[NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"
☆27Nov 21, 2025Updated 8 months ago
pygongnlp / CoSearchAgent
View on GitHub
[SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models
☆30Feb 15, 2024Updated 2 years ago
Thinklab-SJTU / BiLAF
View on GitHub
Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"
☆14Feb 11, 2025Updated last year
WadeYin9712 / UI-Simulator
View on GitHub
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
☆21Oct 17, 2025Updated 9 months ago
disi-unibo-nlp / bio-ee-egv
View on GitHub
[COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs
☆10Nov 5, 2022Updated 3 years ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
0russwest0 / Awesome-Agent-RL
View on GitHub
☆511Oct 11, 2025Updated 9 months ago
guanrenyang / Tiny-TPU
View on GitHub
☆10Dec 15, 2023Updated 2 years ago
Ruiyang-061X / LiSe
View on GitHub
[ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
☆40Sep 3, 2024Updated last year
xinli0928 / COVID-Xray
View on GitHub
☆21Oct 10, 2020Updated 5 years ago
ICYPOLE / Fudan-Course-Search
View on GitHub
复旦研究生抢课脚本
☆10Feb 14, 2022Updated 4 years ago
ZhengSaber / Cell-Level-RSRP-Estimation
View on GitHub
Cell-Level RSRP Estimation with the Image-to-Image Wireless Propagation Model Based on Measured data.
☆13Oct 10, 2023Updated 2 years ago
ashish-gehani / Trimmer
View on GitHub
☆13Jul 4, 2024Updated 2 years ago
stephane-caron / matplotlive
View on GitHub
Stream live plots to a matplotlib figure
☆79Jul 3, 2026Updated 2 weeks ago
nji3 / PCA_Autoencoder_FisherFace
View on GitHub
Using PCA, Autoencoder and Fisher linear discriminant to extract the effective representations from the face images. Do the reconstructio…
☆12Apr 23, 2019Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lbhm / fainder
View on GitHub
A fast and accurate index for distribution-aware dataset search.
☆10Feb 3, 2026Updated 5 months ago
yzhang1918 / cikm2022rudi
View on GitHub
Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"
☆12Aug 16, 2022Updated 3 years ago
nju-websoft / One2Branch
View on GitHub
☆10Mar 11, 2024Updated 2 years ago
HC-Guo / Awesome-Multimodal-Chain-of-Thought
View on GitHub
Collection of papers and repos for multimodal chain-of-thought
☆89Nov 6, 2024Updated last year
ispras / dedoc
View on GitHub
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical …
☆715Updated this week
jingtian11 / EasyOffer
View on GitHub
《EasyOffer》（<大模型面经合集>）是针对LLM宝宝们量身打造的大模型暑期实习Offer指南，主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等；小白一个，正在学习ing......有问题各位大佬随时指正，希望大家都能拿到心仪Of…
☆810Mar 25, 2025Updated last year
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆782May 10, 2026Updated 2 months ago