tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
☆29Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-cppcuda-tutorial
Users that are interested in pytorch-cppcuda-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆12Dec 5, 2020Updated 5 years ago
- Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS☆10Dec 21, 2021Updated 4 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- ☆69Mar 19, 2023Updated 3 years ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Dec 19, 2023Updated 2 years ago
- Live demo of hls4ml on embedded platforms such as the Pynq-Z2☆13Aug 23, 2024Updated last year
- ☆34Jul 8, 2025Updated 11 months ago
- Share your GPU without MIG or MPS☆50Jan 27, 2026Updated 4 months ago
- Official repository for our paper on "Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segme…☆12Jan 3, 2023Updated 3 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆20Jan 11, 2026Updated 5 months ago
- Fast Neural Network Super-resolution tool based on TensorRT☆15Sep 6, 2025Updated 9 months ago
- ☆27May 27, 2024Updated 2 years ago
- ☆12Nov 23, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Write events for TensorBoard☆13Apr 27, 2026Updated last month
- ☆49Apr 15, 2024Updated 2 years ago
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 3 years ago
- 《自己动手写AI编译器》☆39Oct 19, 2024Updated last year
- ☆15Feb 27, 2024Updated 2 years ago
- MACKO: Sparse matrix vector multiplication for low sparsity☆38Apr 6, 2026Updated 2 months ago
- GAN Step By Step -- GSBS,顾名思义,我希望我自己能够一步一步的学习GAN。GAN 又名 生成对抗网络,是最近几年很热门的一种无监督算法,他能生成出非常逼真的照片,图像甚至视频。GAN是一个图像的全新的领域,从2014的GAN的发展现在,在计算机视觉中…☆11Jan 11, 2023Updated 3 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- yolov8s-pose using ncnn inferring!☆44Apr 27, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICCV 2025] Official implementation of X2-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction☆57Oct 27, 2025Updated 7 months ago
- ☆14Jan 14, 2020Updated 6 years ago
- ☆13Apr 9, 2024Updated 2 years ago
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- Examples illustrating usage of the rocBLAS library☆17Aug 12, 2024Updated last year
- Implementation of "Learning Fast 3D Gaussian Splatting Rendering using Continuous Level of Detail" presented at Eurographics 2025.☆30Jun 2, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- Modules in Matlab to implement the Numerical Renormalization Group technique.☆17Oct 1, 2020Updated 5 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆424Apr 17, 2023Updated 3 years ago
- ☆21Apr 19, 2023Updated 3 years ago
- Superresolution running on Rockchip NPU (RK3588, etc..)☆22Jul 7, 2024Updated last year
- Learning Inter-Superpoint Affinity for Weakly Supervised 3D Instance Segmentation☆25Dec 6, 2022Updated 3 years ago
- ☆39Jun 2, 2026Updated last week
- Virtualized Accelerator Orchestration for Multi-Tenant Workloads☆21Nov 17, 2024Updated last year
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 7 years ago