JerryYin777 / PaperHelperLinks
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References
☆16Updated last year
Alternatives and similar repositories for PaperHelper
Users that are interested in PaperHelper are comparing it to the libraries listed below
Sorting:
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 8 months ago
- ☆20Updated 3 weeks ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆46Updated 3 weeks ago
- Official Implementation of APB (ACL 2025 main)☆28Updated 4 months ago
- Dynamic Context Selection for Efficient Long-Context LLMs☆33Updated last month
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆46Updated 8 months ago
- survery of small language models☆15Updated 11 months ago
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆18Updated last month
- ☆29Updated 2 months ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆21Updated 3 months ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆52Updated 10 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Updated last year
- ☆28Updated 4 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆39Updated last year
- ☆57Updated 3 weeks ago
- Code for Robust Fine-tuning (RbFT)☆12Updated 4 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆28Updated 4 months ago
- ☆22Updated 11 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 4 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆11Updated this week
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆62Updated 4 months ago
- SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference☆47Updated 7 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆14Updated 4 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆23Updated 7 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆53Updated 11 months ago
- ☆58Updated last week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆45Updated 7 months ago
- ☆53Updated last week