Rebecia / OSS-FGILinks
☆16Updated 10 months ago
Alternatives and similar repositories for OSS-FGI
Users that are interested in OSS-FGI are comparing it to the libraries listed below
Sorting:
- 自学LLM的一些笔记与八股☆18Updated 10 months ago
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆1,117Updated 2 weeks ago
- Awesome list for LLM pruning.☆281Updated 3 months ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,253Updated 7 months ago
- ☆92Updated 6 months ago
- Open-source code and data for ShadowNet(S&P Oakland'23)☆11Updated last year
- Paper list for Efficient Reasoning.☆822Updated last week
- Curated collection of papers in MoE model inference☆341Updated 3 months ago
- ☆14Updated last year
- Using LLM to evaluate MMLU dataset.☆42Updated last year
- Reading notes on Speculative Decoding papers☆21Updated 2 months ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆190Updated 2 years ago
- Fast inference from large lauguage models via speculative decoding☆886Updated last year
- 🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasonin…☆65Updated 8 months ago
- A collection list for Large Language Model (LLM) Watermark☆57Updated this week
- My implementation of Stanford CS336 assignments.☆227Updated 6 months ago
- 【2024年新版】国科大 陈云霁 智能计算系统AICS实验代码☆491Updated 7 months ago
- TSQP: Safeguarding Real-Time Inference for Quantization Neural Networks on Edge Devices (Accepted to S&P 2025)☆17Updated 4 months ago
- ☆580Updated 7 months ago
- ☆34Updated 10 months ago
- ☆22Updated last year
- Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions. ICLR 2022☆11Updated 3 years ago
- An interactive attention visualization and intervention tool for LLM Decode Stage.☆43Updated last month
- 北京大学软件与微电子学院关键软件方向课程资料、作业等汇总(操作系统与虚拟化、深度学习技术与应用等)☆34Updated last year
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆411Updated 11 months ago
- Official implementation for "HuRef: HUman-REadable Fingerprint for Large Language Models" (NeurIPS2024)☆15Updated 7 months ago
- A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).☆1,847Updated 2 weeks ago
- ☆26Updated last year
- List of papers related to neural network quantization in recent AI conferences and journals.☆795Updated 10 months ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆281Updated 2 months ago