JerryYin777 / PaperHelperLinks
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References
☆19Updated last year
Alternatives and similar repositories for PaperHelper
Users that are interested in PaperHelper are comparing it to the libraries listed below
Sorting:
- Deepseek-r1复现科普与资源汇总☆22Updated 9 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 10 months ago
- Manages vllm-nccl dependency☆17Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 7 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆102Updated 8 months ago
- Official Implementation of APB (ACL 2025 main Oral)☆32Updated 9 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆61Updated last year
- 研究生课《网络大数据管理理论和应用》大作业项目代码☆13Updated 2 years ago
- ☆54Updated 9 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆137Updated last year
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆93Updated 6 months ago
- A High-Efficiency System of Large Language Model Based Search Agents☆74Updated 5 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆105Updated last month
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Updated 3 weeks ago
- ☆86Updated 3 months ago
- ☆95Updated last year
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆19Updated 11 months ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆44Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆55Updated 6 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆118Updated 6 months ago
- ☆56Updated 6 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- LLM手撕代码合集☆17Updated 8 months ago
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆29Updated last year
- Dynamic Context Selection for Efficient Long-Context LLMs☆45Updated 6 months ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆31Updated 9 months ago
- ☆98Updated last week
- 珠算代码大模型(Abacus Code LLM)☆57Updated last year