tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
☆29Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-cppcuda-tutorial
Users that are interested in pytorch-cppcuda-tutorial are comparing it to the libraries listed below
Sorting:
- Theoretical modelling of doping effects and magnetic field effects on the quantum transport in Graphene.☆14Mar 29, 2013Updated 12 years ago
- Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS☆10Dec 21, 2021Updated 4 years ago
- ☆30Oct 17, 2024Updated last year
- ☆10Dec 19, 2023Updated 2 years ago
- ☆69Mar 19, 2023Updated 3 years ago
- ☆10Aug 31, 2023Updated 2 years ago
- Code for ICCV 2023 work "Generalized Few-Shot Point Cloud Segmentation Via Geometric Words"☆12Sep 26, 2023Updated 2 years ago
- ☆16Feb 21, 2026Updated last month
- Perceptron-based branch predictor written in C++☆13Dec 14, 2016Updated 9 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆19Jan 11, 2026Updated 2 months ago
- Official repository for our paper on "Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segme…☆12Jan 3, 2023Updated 3 years ago
- Tutorial for (PyTorch) + (C++) + (Metal shader)☆16Oct 25, 2025Updated 4 months ago
- 《自己动手写AI编译器》☆34Oct 19, 2024Updated last year
- ☆15Feb 27, 2024Updated 2 years ago
- yolov8s-pose using ncnn inferring!☆43Apr 27, 2023Updated 2 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆63Nov 8, 2024Updated last year
- Apply Graph Neural Networks to Optimize Factor Feature Extraction of FactorVAE☆13Jan 11, 2025Updated last year
- Implementation of "Learning Fast 3D Gaussian Splatting Rendering using Continuous Level of Detail" presented at Eurographics 2025.☆27Jun 2, 2025Updated 9 months ago
- ☆13Apr 9, 2024Updated last year
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- ☆31Mar 9, 2026Updated last week
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- ☆21Feb 14, 2025Updated last year
- Virtualized Accelerator Orchestration for Multi-Tenant Workloads☆20Nov 17, 2024Updated last year
- A tutorial on the linear scaling quantum transport methods using Jupyter (with Python3)☆14Mar 17, 2019Updated 7 years ago
- ☆14May 19, 2023Updated 2 years ago
- Inference deployment of the llama3☆10Apr 21, 2024Updated last year
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 6 years ago
- ☆11Feb 28, 2023Updated 3 years ago
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆21Dec 10, 2022Updated 3 years ago
- Code and Solutions from the Self-Driving Car course from Udacity☆16Jun 8, 2017Updated 8 years ago
- ☆14May 22, 2019Updated 6 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- ☆20Feb 12, 2025Updated last year
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆55Jan 30, 2026Updated last month
- Type an M x M matrix for your open quantum system Hamiltonian, and give a spectral density (analytic or numerical). FeynDyn gives the den…☆24Sep 7, 2022Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Use https://ctags.io instead (This was fork of http://ctags.sourceforge.net/)☆25Jul 25, 2015Updated 10 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago