Free resource for the book AI Compiler Development Guide
☆50Dec 22, 2022Updated 3 years ago
Alternatives and similar repositories for AI_compiler_development_guide
Users that are interested in AI_compiler_development_guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated 2 years ago
- 学生管理系统☆16Feb 3, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- AIDL compiler for C++ on Linux Desktop☆12Aug 17, 2016Updated 9 years ago
- This repository is Onnx tutorial summary for python implements , which comes from other web resource.☆29Oct 23, 2022Updated 3 years ago
- ☆154Mar 18, 2024Updated 2 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆34Nov 30, 2022Updated 3 years ago
- A Distributed RDF Data Management System for Processing SPARQL Queries Over Distributed RDF Graphs☆14Dec 8, 2022Updated 3 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago
- ☆26May 22, 2023Updated 3 years ago
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- 学生学籍管理系统☆14Apr 16, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- My study note for mlsys☆14Nov 4, 2024Updated last year
- A curated list of research papers, datasets, and tools for applying machine learning/Deep learning techniques to compilers and program op…☆125Sep 28, 2023Updated 2 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Dec 12, 2023Updated 2 years ago
- ☆12Oct 29, 2020Updated 5 years ago
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆14Oct 10, 2024Updated last year
- ☆20Feb 4, 2021Updated 5 years ago
- a simple x86/arm jit framework for c☆37Mar 2, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RISC-V CPU Labs in Chisel☆80Jan 31, 2026Updated 3 months ago
- Open source of the paper "击败SOTA反混淆方法"☆16Sep 10, 2022Updated 3 years ago
- how to optimize some algorithm in cuda.☆2,998Updated this week
- Parallel Prefix Sum (Scan) with CUDA☆29Jun 22, 2024Updated last year
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆20Sep 1, 2025Updated 8 months ago
- Embedded Universal DSL: a good DSL for us, by us☆74Updated this week
- ☆48Mar 27, 2023Updated 3 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- A tiny Debugger : - )☆10Jan 24, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 鉴定网络热门并行编程框架 - 性能测评(附小彭老师锐评)已评测:Taichi、SyCL、C++、OpenMP、TBB、Mojo☆40Aug 28, 2023Updated 2 years ago
- ☆19Apr 28, 2021Updated 5 years ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆12Aug 16, 2023Updated 2 years ago
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- A Computational Graph Generator for AI Compiler Fuzzing☆16May 31, 2023Updated 2 years ago
- FFTE: A Fast Fourier Transform Package (Official tarballs are unpacked into master as commits)☆12Feb 17, 2024Updated 2 years ago
- Android yolox hand detect by ncnn☆19Aug 19, 2021Updated 4 years ago