HackerHyper / ACMVHLinks
Adaptive Confidence Multi-View Hashing
☆23Updated last year
Alternatives and similar repositories for ACMVH
Users that are interested in ACMVH are comparing it to the libraries listed below
Sorting:
- CLIPMH:CLIP Multi-modal Hashing☆40Updated 10 months ago
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆320Updated last month
- ☆608Updated 3 months ago
- Curated collection of papers in MoE model inference☆250Updated last month
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆37Updated last month
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆412Updated last month
- a curated list of high-quality papers on resource-efficient LLMs 🌱☆134Updated 5 months ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,209Updated 2 months ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆192Updated last month
- Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…☆546Updated 11 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆356Updated 6 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆525Updated last month
- A large-scale simulation framework for LLM inference☆428Updated last month
- ☆46Updated 3 years ago
- This is the source code of our ICML25 paper, titled "Accelerating Large Language Model Reasoning via Speculative Search".☆20Updated 3 months ago
- A reading list for deep graph learning acceleration.☆249Updated last month
- Survey Paper List - Efficient LLM and Foundation Models☆255Updated 11 months ago
- Towards Generalized and Efficient Blackbox Optimization System/Package (KDD 2021 & JMLR 2024)☆424Updated 2 weeks ago
- Repo for SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting (ISCA25)☆48Updated 4 months ago
- [Mlsys'22] Understanding gnn computational graph: A coordinated computation, io, and memory perspective☆20Updated last year
- ATC23 AE☆46Updated 2 years ago
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆90Updated 2 years ago
- 📰 Must-read papers and blogs on Speculative Decoding ⚡️☆906Updated this week
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆152Updated last year
- LLM Inference analyzer for different hardware platforms☆87Updated last month
- ☆100Updated last year
- A curated list of awesome projects and papers for distributed training or inference☆241Updated 10 months ago
- Awesome list for LLM pruning.☆256Updated this week
- ☆333Updated last year
- AI and Memory Wall☆220Updated last year