Obsolete version of CUDA-mode repo -- use cuda-mode/lectures instead
☆28Feb 8, 2024Updated 2 years ago
Alternatives and similar repositories for lecture2
Users that are interested in lecture2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TileGraph is an experimental DNN compiler that utilizes static code generation and kernel fusion techniques.☆11Sep 18, 2024Updated last year
- Persistent Kernel + JIT-Injected Operators (CUDA)☆47Jan 27, 2026Updated 3 months ago
- Fast-track AI apps to production with LLaMA 3, Mistral, and other top LLMs!☆20Jul 12, 2024Updated last year
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- ☆20May 28, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21Jan 29, 2026Updated 3 months ago
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- 使用ONNXRuntime部署一种用于边缘检测的轻量级密集卷积神经网络LDC,包含C++和Python两个版本的程序☆11Apr 24, 2023Updated 3 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- ☆10Jul 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- No code solution for training tabular models☆35May 12, 2026Updated last week
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 9 months ago
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Mar 17, 2026Updated 2 months ago
- lightNet (Object Detection and Semantic Segmentation) for ONNX and TensorRT☆16Jul 4, 2023Updated 2 years ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆11Sep 10, 2024Updated last year
- FastSAM 部署rknn C++ 代码☆13May 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 4 months ago
- ☆178Feb 3, 2024Updated 2 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆14May 24, 2024Updated 2 years ago
- snpe tutorial☆10Dec 25, 2023Updated 2 years ago
- Qwen3-VL-2B on the RK3588 NPU☆29Feb 2, 2026Updated 3 months ago
- ☆12Dec 16, 2021Updated 4 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆14Nov 3, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- ☆13Oct 5, 2023Updated 2 years ago
- Multiple Lidar preprocessor for BEVfusion☆11Aug 25, 2023Updated 2 years ago
- Workshop series teaching the AI Engineering Lifecycle using LangChain, LangGraph, and LangSmith☆33Updated this week
- ☆16Oct 4, 2024Updated last year
- Recording models☆12Sep 19, 2023Updated 2 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year