JerryYin777 / PaperHelper
PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References
☆12Updated 7 months ago
Alternatives and similar repositories for PaperHelper:
Users that are interested in PaperHelper are comparing it to the libraries listed below
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 3 months ago
- ☆27Updated last month
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆12Updated 7 months ago
- Efficient Mixture of Experts for LLM Paper List☆26Updated last month
- survery of small language models☆14Updated 5 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆17Updated 2 weeks ago
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆29Updated 7 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 7 months ago
- A tiny, didactical implementation of LLAMA 3☆35Updated last month
- ☆37Updated 3 months ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆38Updated 10 months ago
- TensorRT LLM Benchmark Configuration☆12Updated 5 months ago
- Triton implement of bi-directional (non-causal) linear attention☆35Updated last week
- Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang☆30Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆39Updated 2 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆15Updated last week
- Manages vllm-nccl dependency☆16Updated 7 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- ☆38Updated 11 months ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆23Updated last month
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆19Updated 8 months ago
- ICDE 2024 Paper, MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation☆17Updated last month
- ☆13Updated last year
- SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference☆36Updated 2 months ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆44Updated 6 months ago
- Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?☆89Updated 3 months ago
- aigc evals☆10Updated last year
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆40Updated 2 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated 3 months ago
- Open-Pandora: On-the-fly Control Video Generation☆31Updated last month