☆29Mar 24, 2025Updated 11 months ago
Alternatives and similar repositories for FastTree-Artifact
Users that are interested in FastTree-Artifact are comparing it to the libraries listed below
Sorting:
- A recommendation model kernel optimizing system☆12Jun 5, 2025Updated 9 months ago
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated 9 months ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆50Jun 17, 2025Updated 9 months ago
- ☆21Oct 21, 2024Updated last year
- A Parallel Secure Machine Learning Framework on GPUs☆21Nov 17, 2021Updated 4 years ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- ☆15Jan 7, 2022Updated 4 years ago
- Dynamic Memory Management for Serving LLMs without PagedAttention☆466May 30, 2025Updated 9 months ago
- ☆33Jul 17, 2024Updated last year
- ☆19Dec 24, 2024Updated last year
- a size profiler for cuda binary☆72Jan 15, 2026Updated 2 months ago
- ☆85Apr 18, 2025Updated 11 months ago
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- ☆21Jul 24, 2025Updated 7 months ago
- ☆11Apr 10, 2024Updated last year
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆39Oct 5, 2025Updated 5 months ago
- ☆33Mar 31, 2025Updated 11 months ago
- Short RL☆18May 26, 2025Updated 9 months ago
- Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity☆237Sep 24, 2023Updated 2 years ago
- ☆25Oct 9, 2025Updated 5 months ago
- ☆19Feb 18, 2025Updated last year
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆12Nov 8, 2024Updated last year
- ☆63May 16, 2025Updated 10 months ago
- Official implementation of paper "HiAE: A High-Throughput Authenticated Encryption Algorithm for Cross-Platfor Efficiency"☆19Nov 11, 2025Updated 4 months ago
- B站爬虫☆15Dec 10, 2023Updated 2 years ago
- A book about Ph.D. student and research career planning☆29Oct 21, 2025Updated 5 months ago
- Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)☆15Jul 17, 2025Updated 8 months ago
- ☆14Dec 5, 2024Updated last year
- A DAG processor and compiler for a tree-based spatial datapath.