LightChen233 / AutoPRLinks
This is the official implementation for "AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!".
☆94Updated 3 months ago
Alternatives and similar repositories for AutoPR
Users that are interested in AutoPR are comparing it to the libraries listed below
Sorting:
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆74Updated 2 months ago
- ☆32Updated 8 months ago
- ☆87Updated 5 months ago
- ☆54Updated 11 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆68Updated 8 months ago
- ☆46Updated 3 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆84Updated 3 months ago
- Token level visualization tools for large language models☆91Updated last year
- ArxivFlow - Periodic Track on arXiv Paper☆50Updated 4 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- ☆49Updated 5 months ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆226Updated 5 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆189Updated 2 weeks ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆83Updated last week
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆101Updated last year
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Updated 2 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆164Updated last month
- ☆216Updated 6 months ago
- ☆192Updated 3 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆33Updated 10 months ago
- [ICLR 2026] P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆47Updated 8 months ago
- ☆82Updated 10 months ago
- Reproducing R1 for Code with Reliable Rewards☆285Updated 9 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆41Updated 9 months ago
- Scaling Long-Horizon LLM Agent via Context-Folding☆106Updated last week
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆94Updated 2 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Updated 2 months ago