☆51Mar 4, 2026Updated 3 months ago
Alternatives and similar repositories for triton_course
Users that are interested in triton_course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A light llama-like llm inference framework based on the triton kernel.☆188Jan 5, 2026Updated 5 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆120Jul 11, 2025Updated 11 months ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆548Oct 28, 2025Updated 7 months ago
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- A CUDA tutorial to make people learn CUDA program from 0☆279Jul 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- some hpc project for learning☆28Aug 28, 2024Updated last year
- UNet-Pruning b developing NNI☆10Sep 2, 2020Updated 5 years ago
- ☆66Apr 26, 2025Updated last year
- my cs notes☆70Oct 14, 2024Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- A selective knowledge distillation algorithm for efficient speculative decoders☆40Nov 27, 2025Updated 6 months ago
- ゼロから作るDeep Learning ❸ をC++で実装する。自習用リポジトリ。☆17Aug 12, 2020Updated 5 years ago
- ☆12May 19, 2022Updated 4 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library st…☆3,435Jun 22, 2025Updated 11 months ago
- ☆23May 29, 2026Updated 2 weeks ago
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- Wanwu models release, code will be released soon☆24Aug 24, 2022Updated 3 years ago
- ☆40Updated this week
- ☆12Feb 7, 2018Updated 8 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several…☆1,314Jul 29, 2023Updated 2 years ago
- ☆28Aug 9, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Estimate depth from surface normal.☆12Aug 14, 2020Updated 5 years ago
- vs2013,opencv2.4.9☆10Jul 3, 2017Updated 8 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 2 months ago
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- ☆29Apr 7, 2025Updated last year
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Dataset☆23Nov 22, 2024Updated last year
- remote sensing scene classification☆12Mar 1, 2023Updated 3 years ago
- Calibration of depth sensors, e.g. Kinect, Asus Xtion☆13Apr 26, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- cuda编程学习入门☆38Jul 22, 2024Updated last year
- Baseline Code for CVPR 2023 paper. "Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline".☆15Sep 21, 2023Updated 2 years ago
- This is an implementation of YOLO using LSQ network quantization method.☆22Apr 13, 2022Updated 4 years ago
- ☆150Jan 9, 2025Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Clique Percolation Method to extract communities for a graph network. [R implementation]☆24Jun 1, 2021Updated 5 years ago
- ToyLLM: Learning LLM from Scratch☆25Jun 8, 2026Updated last week