JerryYin777 / PaperHelperLinks
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References
☆16Updated 11 months ago
Alternatives and similar repositories for PaperHelper
Users that are interested in PaperHelper are comparing it to the libraries listed below
Sorting:
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆20Updated 3 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 7 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆46Updated 7 months ago
- A High-Efficiency System of Large Language Model Based Search Agents☆41Updated last week
- survery of small language models☆15Updated 10 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆18Updated last week
- ☆42Updated 3 months ago
- ☆20Updated last week
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆46Updated last week
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆51Updated 2 weeks ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆39Updated last year
- ☆14Updated last year
- Tutorial for Ray☆25Updated last year
- Official Implementation of APB (ACL 2025 main)☆28Updated 3 months ago
- Code for Robust Fine-tuning (RbFT)☆12Updated 4 months ago
- Long short token decoding speed up 4x for long context LLM. A hundred lines of core code. Open source for learning.☆8Updated 10 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆29Updated 3 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆22Updated 6 months ago
- ☆49Updated 3 weeks ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- ☆33Updated last month
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆11Updated last week
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20Updated last week
- LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs☆21Updated last week
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆19Updated 2 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆45Updated 3 months ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Updated last year
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆13Updated 3 months ago
- Deepseek-r1复现科普与资源汇总☆21Updated 3 months ago