☆42Apr 25, 2024Updated last year
Alternatives and similar repositories for tlp
Users that are interested in tlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆97Nov 4, 2022Updated 3 years ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Nov 7, 2019Updated 6 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆123Oct 26, 2022Updated 3 years ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆33Mar 5, 2026Updated last month
- ☆52Dec 13, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Jan 7, 2025Updated last year
- ☆17Dec 8, 2023Updated 2 years ago
- ☆11Sep 14, 2020Updated 5 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators☆23Jan 30, 2024Updated 2 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆22Apr 9, 2026Updated last week
- A home for the final text of all TVM RFCs.☆109Sep 24, 2024Updated last year
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch☆40Mar 27, 2025Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆144Mar 31, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A shader system built using staged metaprogramming☆15Jul 9, 2022Updated 3 years ago
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…☆25Jan 7, 2025Updated last year
- examples for tvm schedule API☆101Jun 12, 2023Updated 2 years ago
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆13Jul 8, 2019Updated 6 years ago
- Mille Crepe Bench: layer-wise performance analysis for deep learning frameworks.☆18Oct 22, 2019Updated 6 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- This is a list of awesome edgeAI inference related papers.☆99Dec 21, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆14Oct 21, 2020Updated 5 years ago
- A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT☆17Oct 26, 2020Updated 5 years ago
- ☆175Apr 2, 2026Updated 2 weeks ago
- Automatic Differentiation for Tensor Algebras☆28May 8, 2018Updated 7 years ago
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- Supplementary materials for our SIGGRAPH 2022 paper☆28Apr 28, 2022Updated 3 years ago
- Alex Graves' Adaptive Computation Time in PyTorch☆14Jan 9, 2018Updated 8 years ago
- A self-contained version of the tutorial which can be easily cloned and viewed by others.☆24Jun 24, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆29Feb 18, 2022Updated 4 years ago
- ☆13Feb 22, 2023Updated 3 years ago
- 一门公开课《MIT6.824》的大作业☆12Jun 21, 2021Updated 4 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,001Sep 19, 2024Updated last year
- ☆16Oct 3, 2023Updated 2 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Jul 27, 2023Updated 2 years ago
- ☆41Oct 12, 2020Updated 5 years ago