Official repo for paper ConvSearch-R1
☆61Nov 4, 2025Updated 7 months ago
Alternatives and similar repositories for ConvSearch-R1
Users that are interested in ConvSearch-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆33Jan 4, 2026Updated 5 months ago
- Repo for Llatrieval☆31Aug 21, 2024Updated last year
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆35Mar 5, 2024Updated 2 years ago
- ☆23May 21, 2025Updated last year
- UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience☆72Apr 3, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆30Mar 18, 2026Updated 2 months ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents☆57Jan 28, 2025Updated last year
- ☆27Apr 4, 2026Updated 2 months ago
- Fetch a random wallpaper from Konachan.☆10Jun 4, 2018Updated 8 years ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- Dataset for the paper "GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation"☆26Jan 2, 2024Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- ☆15Jan 23, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Feb 18, 2025Updated last year
- [ACL'26 Findings] Official code for "BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search"☆30Apr 23, 2026Updated last month
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆68Mar 4, 2025Updated last year
- The official implementation of Hard Negative Sampling via Large Language Models for Recommendation.☆11Jan 17, 2026Updated 4 months ago
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated last year
- Dive into LLM Agents☆18Jun 1, 2024Updated 2 years ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆57Aug 28, 2025Updated 9 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆29Apr 8, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆16Nov 25, 2024Updated last year
- 基于django开发的自习室预约系统☆10Nov 12, 2024Updated last year
- [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".☆23Oct 28, 2025Updated 7 months ago
- ☆318Jan 3, 2026Updated 5 months ago
- Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"☆109Mar 28, 2026Updated 2 months ago
- pdf to markdown with Python3☆11Oct 30, 2019Updated 6 years ago
- [EMNLP 2025]* FusionDTI utilises a Token-level Fusion module to effectively learn fine-grained information for Drug-Target Interaction Pr…☆41Mar 6, 2026Updated 3 months ago
- ☆47Apr 20, 2025Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆67May 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)☆837Nov 5, 2025Updated 7 months ago
- [ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner☆61May 29, 2026Updated last week
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆46Jun 24, 2025Updated 11 months ago
- 一些目标检测任务需要用到的数据转换以及常见数据处理函数☆15Dec 22, 2022Updated 3 years ago
- ☆11Jun 16, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago