☆31Mar 24, 2025Updated last year
Alternatives and similar repositories for FastTree-Artifact
Users that are interested in FastTree-Artifact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A recommendation model kernel optimizing system☆12Jun 5, 2025Updated last year
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated last year
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆52Jun 17, 2025Updated last year
- 此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。☆15Mar 1, 2024Updated 2 years ago
- ☆22Oct 21, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development☆17Jan 6, 2026Updated 5 months ago
- Dynamic Memory Management for Serving LLMs without PagedAttention☆498Jun 10, 2026Updated 3 weeks ago
- ☆32Jul 17, 2024Updated last year
- ☆20Dec 24, 2024Updated last year
- ☆13Mar 18, 2022Updated 4 years ago
- a size profiler for cuda binary☆69Jan 15, 2026Updated 5 months ago
- An asynchronous streaming data management module for efficient post-training.☆103Updated this week
- ☆88Apr 18, 2025Updated last year
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Jul 24, 2025Updated 11 months ago
- ☆10Apr 10, 2024Updated 2 years ago
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆39Oct 5, 2025Updated 8 months ago
- ☆35Mar 31, 2025Updated last year
- Short RL☆18Apr 16, 2026Updated 2 months ago
- Container-free RL framework for training software engineering agents☆70Jun 24, 2026Updated last week
- Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity☆246Sep 24, 2023Updated 2 years ago
- Github repository for CLAPACK (fork of CLAPACK 3.2.1 patched for our needs)☆10Aug 15, 2018Updated 7 years ago
- ☆106Sep 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Feb 18, 2025Updated last year
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Sep 1, 2024Updated last year
- ☆26Oct 9, 2025Updated 8 months ago
- homework in SCUT_SE☆12Nov 9, 2021Updated 4 years ago
- ☆15Dec 5, 2024Updated last year
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated last year
- A DAG processor and compiler for a tree-based spatial datapath.☆16Aug 24, 2022Updated 3 years ago
- GBDT-based model with efficient unlearning (SIGMOD 2023)☆10Sep 7, 2025Updated 9 months ago
- ☆47Sep 8, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A throughput-oriented high-performance serving framework for LLMs☆966Mar 29, 2026Updated 3 months ago
- ☆14Apr 24, 2024Updated 2 years ago
- ☆23Jun 1, 2025Updated last year
- HiAE - A High-Throughput Authenticated Encryption Algorithm for Cross-Platform Efficiency.☆19May 27, 2026Updated last month
- 简单的代码控制系统☆13Oct 16, 2021Updated 4 years ago
- Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)☆27Aug 29, 2024Updated last year
- [HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.☆96May 14, 2026Updated last month