机器学习编译 陈天奇
☆54Jan 1, 2023Updated 3 years ago
Alternatives and similar repositories for mlc-ai
Users that are interested in mlc-ai are comparing it to the libraries listed below
Sorting:
- ☆18Nov 22, 2025Updated 3 months ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Dec 21, 2024Updated last year
- learn TensorRT from scratch🥰☆17Sep 29, 2024Updated last year
- 方便扩展的Cuda算子理解和优化框架,仅用在学习使用☆18Jun 13, 2024Updated last year
- 新托福备考心得之附件资源☆14Oct 7, 2023Updated 2 years ago
- ☆12Jul 9, 2021Updated 4 years ago
- This is a repository to practice multi-thread programming in C++☆27Feb 21, 2024Updated 2 years ago
- MXMACA入门materials☆21Jun 9, 2024Updated last year
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- stable diffusion using mnn☆66Sep 28, 2023Updated 2 years ago
- Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities☆13Feb 1, 2017Updated 9 years ago
- A collection of cycling-related websites and tools. 收集与骑行相关的网站与工具.☆16Mar 12, 2026Updated last week
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Nov 30, 2022Updated 3 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Oct 20, 2023Updated 2 years ago
- ☆11Aug 2, 2023Updated 2 years ago
- A car re-identification app based on multi-feature fusion technique☆18Apr 24, 2022Updated 3 years ago
- CUDA PTX-ISA Document 中文翻译版☆50Sep 29, 2025Updated 5 months ago
- Learning Matchable Image Transformations☆13Sep 10, 2019Updated 6 years ago
- Fast and memory-efficient exact attention☆20Mar 13, 2026Updated last week
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- 数据库内核笔记☆13Aug 18, 2022Updated 3 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- A light llama-like llm inference framework based on the triton kernel.☆174Jan 5, 2026Updated 2 months ago
- Inference deployment of the llama3☆10Apr 21, 2024Updated last year
- ☆12Jun 12, 2025Updated 9 months ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- ☆12Apr 9, 2025Updated 11 months ago
- ☆11Nov 6, 2022Updated 3 years ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆149May 10, 2025Updated 10 months ago
- Pilot Behavior Cloning: An imitation learning method for learning tracking skills from human demonstrations.☆19Jan 11, 2025Updated last year
- Lifelong Robotic Vision Website☆30Apr 26, 2024Updated last year
- ☆15Apr 1, 2023Updated 2 years ago
- ☆15Dec 1, 2023Updated 2 years ago
- ☆25Mar 9, 2026Updated last week
- ☆15Jan 27, 2026Updated last month
- ☆13Nov 25, 2019Updated 6 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆11Dec 31, 2024Updated last year
- A simple and fast minimalistic header-only library allowing to run async tasks and execute task graphs.☆60Nov 29, 2024Updated last year
- A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.☆40Mar 4, 2026Updated 2 weeks ago