JerryYin777 / PaperHelperLinks
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References
☆20Updated last year
Alternatives and similar repositories for PaperHelper
Users that are interested in PaperHelper are comparing it to the libraries listed below
Sorting:
- ☆87Updated 5 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- 最简易的R1结 果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated last year
- ☆106Updated 2 months ago
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆32Updated last week
- ☆82Updated 10 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 7 months ago
- ☆54Updated 11 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Updated last year
- ☆129Updated 8 months ago
- SCOPE: Self-evolving Context Optimization via Prompt Evolution - A framework for automatic prompt optimization☆64Updated last month
- LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs☆28Updated 8 months ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆34Updated 11 months ago
- SPEC-RL: Accelerating On-Policy Reinforcement Learning via Speculative Rollouts☆63Updated 2 months ago
- Dynamic Context Selection for Efficient Long-Context LLMs☆55Updated 8 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆200Updated 6 months ago
- This is the official implementation for "AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!".☆94Updated 3 months ago
- Manages vllm-nccl dependency☆17Updated last year
- ☆74Updated 8 months ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆67Updated last year
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆121Updated 8 months ago
- ☆64Updated 8 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 9 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆94Updated 2 months ago
- ☆22Updated 11 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆63Updated 3 months ago
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆61Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆33Updated 10 months ago
- MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning☆41Updated 5 months ago