基于 CUDA Driver API 的 cuda 运行时环境
☆14Jul 30, 2025Updated 7 months ago
Alternatives and similar repositories for cuda-driver
Users that are interested in cuda-driver are comparing it to the libraries listed below
Sorting:
- 算子库(Rust)☆14Jul 24, 2025Updated 7 months ago
- 实验:rust 实现 llama2 推理☆17Feb 23, 2024Updated 2 years ago
- 遍历设备树二进制对象☆14Nov 22, 2025Updated 3 months ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 7 months ago
- ☆125Jan 22, 2026Updated last month
- 分层解耦的深度学习推理引擎☆78Feb 17, 2025Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆11Dec 31, 2024Updated last year
- 笔记☆52Aug 15, 2025Updated 7 months ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆17Apr 30, 2024Updated last year
- 重构nerf代码,更加容易读懂☆12Mar 26, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆31Apr 2, 2025Updated 11 months ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆12Mar 5, 2025Updated last year
- 🎉My Collections of CUDA Kernels~☆10Jun 25, 2024Updated last year
- Hypervisor written in Rust for the RISC-V 1.0 hypervisor extension☆16Oct 21, 2024Updated last year
- a coroutinue lib writen by pure C☆10Feb 24, 2021Updated 5 years ago
- Paging Debug tool for GDB using python☆13Jun 4, 2022Updated 3 years ago
- [WIP] A tiny RISC-V hypervisor software written in Rust☆27Dec 8, 2020Updated 5 years ago
- 在RISC-V处理器上实现一个轻量级的Hypervisor。☆12Dec 25, 2020Updated 5 years ago
- ☆18Jan 4, 2024Updated 2 years ago
- bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码☆33Aug 12, 2024Updated last year
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- 各类内核的设计思路☆19May 19, 2021Updated 4 years ago
- ☆20Sep 28, 2024Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- FastSAM 部署版本,便于移植不同平,部署简单、运行速度快。☆23May 30, 2024Updated last year
- SGEMM optimization with cuda step by step☆21Mar 23, 2024Updated last year
- Rust powered flash programmer and on chip debugger for embedded devices☆13Dec 1, 2019Updated 6 years ago
- Comparing Rust crate function speeds☆16Jan 11, 2019Updated 7 years ago
- ☆18May 31, 2022Updated 3 years ago
- Coffer is a RISC-V trusted execution environment developed in Rust.☆20Mar 3, 2022Updated 4 years ago
- handle gguf files☆13Aug 14, 2025Updated 7 months ago
- ☆12Apr 19, 2023Updated 2 years ago
- Raw bindings to the BlueZ Linux Bluetooth library for Rust☆16Feb 7, 2023Updated 3 years ago
- 本仓库基于 Intel OpenVINO Toolkit 部署 LightTrack 跟踪算法,包含 Python、C++ 两种语言的推理代码.☆20Nov 2, 2023Updated 2 years ago
- 自嗨虚拟化软件 - 'Enjoy yourself' type-1 hypervisor software☆25Apr 21, 2022Updated 3 years ago
- ☆30Jun 1, 2023Updated 2 years ago
- Harden your Rust with specifications.☆84Mar 10, 2026Updated last week
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆95Feb 20, 2026Updated last month
- RustSBI support on SiFive FU740 board; FU740 is a five-core heterogeneous processor with four SiFive U74 cores, and one SiFive S7 core☆17Jul 20, 2023Updated 2 years ago