tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
☆29Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-cppcuda-tutorial
Users that are interested in pytorch-cppcuda-tutorial are comparing it to the libraries listed below
Sorting:
- ☆30Oct 17, 2024Updated last year
- ☆32Jul 8, 2025Updated 7 months ago
- All Resources from Stanford CS106B 2021☆23Jul 11, 2025Updated 7 months ago
- HealthiVert-GAN, a novel deep-learning framework designed to generate pseudo-healthy vertebral images. These images simulate the pre-frac…☆11Nov 3, 2025Updated 3 months ago
- Tiny C++ LLM inference implementation from scratch☆103Jan 29, 2026Updated last month
- Use yolov5 to realize the road occupation operation and vehicle parking violation detection in urban streets, and can independently delin…☆12Jan 2, 2023Updated 3 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- GAN Step By Step -- GSBS,顾名思义,我希望我自己能够一步一步的学习GAN。GAN 又名 生成对抗网络,是最近几年很热门的一种无监督算法,他能生成出非常逼真的照片,图像甚至视频。GAN是一个图像的全新的领域,从2014的GAN的发展现在,在计算机视觉中…☆11Jan 11, 2023Updated 3 years ago
- ☆10Dec 19, 2023Updated 2 years ago
- amdgpu example code in hip/asm☆55Feb 16, 2026Updated last week
- ☆49Apr 15, 2024Updated last year
- 天池学习赛——街景字符编码识别☆16Apr 5, 2021Updated 4 years ago
- yolov8s-pose using ncnn inferring!☆44Apr 27, 2023Updated 2 years ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆57Aug 12, 2024Updated last year
- ☆47Nov 5, 2025Updated 3 months ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Updated this week
- A simplified implementation inspired by Cline☆10Mar 11, 2025Updated 11 months ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- open-source first release (OpenCV, Deepface, YOLOv8, Roboflow)☆13Jan 2, 2025Updated last year
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- Write events for TensorBoard☆11Jun 27, 2024Updated last year
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Radiology Object in COntext version 2☆18Nov 13, 2024Updated last year
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- gradio bbox labeling tools☆11May 12, 2023Updated 2 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ☆11Sep 21, 2022Updated 3 years ago
- This code is for converting COCO json annotations to YOLO txt format (which both are common in object detection projects).☆10Feb 19, 2024Updated 2 years ago
- ☆12Nov 21, 2023Updated 2 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- This repository is for the Computer Vision Nano-degree Program from Udacity.☆11Aug 30, 2024Updated last year
- Procedural city generation.☆13Oct 15, 2022Updated 3 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 6 months ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- Implementation of Nonparametric Hamiltonian Monte Carlo☆13Feb 13, 2023Updated 3 years ago
- ☆13Jul 25, 2023Updated 2 years ago
- ☆12Jul 6, 2018Updated 7 years ago
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆54Jan 30, 2026Updated last month