Free resource for the book AI Compiler Development Guide
☆49Dec 22, 2022Updated 3 years ago
Alternatives and similar repositories for AI_compiler_development_guide
Users that are interested in AI_compiler_development_guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Start AI Compiler☆47Feb 26, 2026Updated last month
- ☆10Sep 7, 2023Updated 2 years ago
- blogs about Coimpiler & Virtual Machine☆12Jun 15, 2025Updated 9 months ago
- An MLIR-based toy DL compiler for TVM Relay.☆61Oct 16, 2022Updated 3 years ago
- a simple general program language☆99Feb 2, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- ☆15Jan 11, 2023Updated 3 years ago
- CVA6-platform is a multicore CVA6 with CV-MESH software and regression platform☆13Nov 12, 2023Updated 2 years ago
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- 学生管理系统☆17Feb 3, 2020Updated 6 years ago
- AIDL compiler for C++ on Linux Desktop☆12Aug 17, 2016Updated 9 years ago
- This repository is Onnx tutorial summary for python implements , which comes from other web resource.☆29Oct 23, 2022Updated 3 years ago
- ☆152Mar 18, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Nov 25, 2020Updated 5 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Nov 30, 2022Updated 3 years ago
- IMPORTANT NOTICE: This implementation is long outdated. Whole-Function Vectorization is an algorithm that transforms a scalar function in…☆22May 16, 2012Updated 13 years ago
- ☆26May 22, 2023Updated 2 years ago
- 学生学籍管理系统☆14Apr 16, 2018Updated 7 years ago
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- A curated list of research papers, datasets, and tools for applying machine learning/Deep learning techniques to compilers and program op…☆124Sep 28, 2023Updated 2 years ago
- My study note for mlsys☆14Nov 4, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Dec 12, 2023Updated 2 years ago
- Hands-On Practical MLIR Tutorial☆748Oct 20, 2023Updated 2 years ago
- play gemm with tvm☆91Jul 22, 2023Updated 2 years ago
- ☆12Oct 29, 2020Updated 5 years ago
- The source code the for the ICLR'24 paper "Stabilizing Backpropagation Through Time to Learn Complex Physics"☆11May 17, 2024Updated last year
- an optimizing compiler to a binary turing machine☆12Dec 16, 2024Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆82Aug 12, 2024Updated last year
- Introduction about SIMD instructions. Mainly about SSE and AVX.☆13Mar 13, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- RISC-V CPU Labs in Chisel☆78Jan 31, 2026Updated 2 months ago
- Open source of the paper "击败SOTA反混淆方法"☆18Sep 10, 2022Updated 3 years ago
- how to optimize some algorithm in cuda.☆2,910Apr 1, 2026Updated last week
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆19Sep 1, 2025Updated 7 months ago
- Embedded Universal DSL: a good DSL for us, by us☆73Updated this week
- ☆48Mar 27, 2023Updated 3 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago