Obsolete version of CUDA-mode repo -- use cuda-mode/lectures instead
☆28Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for lecture2
Users that are interested in lecture2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- ASCII loop animation of view from the side window of a car☆29Oct 22, 2025Updated 6 months ago
- This repository contains a C implementation of matrix multiplication with various optimization techniques.☆15Jun 7, 2025Updated 10 months ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆12Nov 5, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- LDC: Lightweight Dense CNN for Edge DetectionのPythonでのONNX推論サンプル☆15May 6, 2023Updated 2 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- ☆10Jul 18, 2024Updated last year
- No code solution for training tabular models☆35Apr 15, 2026Updated 2 weeks ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Mar 17, 2026Updated last month
- lightNet (Object Detection and Semantic Segmentation) for ONNX and TensorRT☆16Jul 4, 2023Updated 2 years ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- FastSAM 部署rknn C++ 代码☆13May 30, 2024Updated last year
- BPE tokenization implemented in Golang 💙☆11Oct 2, 2023Updated 2 years ago
- Vulnerable demo application for the race condition☆22Apr 27, 2021Updated 5 years ago
- A math resource for CS student (I have decided to refactor the contents to my personal blog and continue working on this, so the project …☆21Nov 27, 2024Updated last year
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 重构nerf代码,更加容易读懂☆13Mar 26, 2023Updated 3 years ago
- ☆178Feb 3, 2024Updated 2 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆13May 24, 2024Updated last year
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- Multivariate Time Series Data usable for Time Series Segmentation and Time Series Classification. Each sample represents the multi-phased…☆11Apr 20, 2024Updated 2 years ago
- ☆12Dec 16, 2021Updated 4 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆14Nov 3, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- Multiple Lidar preprocessor for BEVfusion☆11Aug 25, 2023Updated 2 years ago
- Material for gpu-mode lectures☆6,012Apr 22, 2026Updated last week
- Workshop series teaching the AI Engineering Lifecycle using LangChain, LangGraph, and LangSmith☆31Updated this week
- ☆16Oct 4, 2024Updated last year
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 3 months ago
- Recording models☆12Sep 19, 2023Updated 2 years ago