tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
☆29Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-cppcuda-tutorial
Users that are interested in pytorch-cppcuda-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rust bindings for Kubernetes Container Storage Interface generated from Protobuf using Tonic/Prost☆14Aug 4, 2021Updated 4 years ago
- Theoretical modelling of doping effects and magnetic field effects on the quantum transport in Graphene.☆14Mar 29, 2013Updated 13 years ago
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆12Dec 5, 2020Updated 5 years ago
- [AAAI2024] Summarizing Stream Data for Memory-Restricted Online Continual Learning☆21Apr 30, 2024Updated 2 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Ren…☆138Nov 24, 2024Updated last year
- ☆69Mar 19, 2023Updated 3 years ago
- ☆10Jun 9, 2017Updated 8 years ago
- Fast Neural Network Super-resolution tool based on TensorRT☆15Sep 6, 2025Updated 7 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- Code for ICCV 2023 work "Generalized Few-Shot Point Cloud Segmentation Via Geometric Words"☆14Sep 26, 2023Updated 2 years ago
- Perceptron-based branch predictor written in C++☆13Dec 14, 2016Updated 9 years ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Mar 22, 2026Updated last month
- Official repository for our paper on "Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segme…☆12Jan 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆20Jan 11, 2026Updated 3 months ago
- 大一的软件课程设计I的Qt项目,一部分参考了黑马程序员的教程(十分感谢)。实现了具备背景、bgm的可视化界面六子棋,可以也改成五子棋,实现了人人对战、人机对战、机机对战(观看)功能,可以更改是否开启禁手。☆10Apr 21, 2022Updated 4 years ago
- An attempt to migrate Karpathy's llm.c to safe rust.☆13Jun 4, 2024Updated last year
- Efficient and stable Determinant Quantum Monte Carlo simulations in Python☆11Feb 23, 2026Updated 2 months ago
- ☆27May 27, 2024Updated last year
- ☆12Nov 23, 2020Updated 5 years ago
- Write events for TensorBoard☆12Apr 23, 2026Updated last week
- ☆49Apr 15, 2024Updated 2 years ago
- LLM101n: Let's build a Storyteller 中文版☆137Aug 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TensorRT-in-Action 是一个 GitHub 代码库,提供了使用 TensorRT 的代码示例,并有对应 Jupyter Notebook。☆15Jun 1, 2023Updated 2 years ago
- 《自己动手写AI编译器》☆38Oct 19, 2024Updated last year
- Tutorial for (PyTorch) + (C++) + (Metal shader)☆16Oct 25, 2025Updated 6 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆65Nov 8, 2024Updated last year
- yolov8s-pose using ncnn inferring!☆44Apr 27, 2023Updated 3 years ago
- ☆14Jan 14, 2020Updated 6 years ago
- Apply Graph Neural Networks to Optimize Factor Feature Extraction of FactorVAE☆13Jan 11, 2025Updated last year
- ☆13Apr 9, 2024Updated 2 years ago
- This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.☆43Sep 29, 2025Updated 7 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- Examples illustrating usage of the rocBLAS library☆17Aug 12, 2024Updated last year
- Jupyter notebook with a didactic implementation of DMRG.☆19Mar 10, 2023Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- ☆21Apr 19, 2023Updated 3 years ago
- Learning Inter-Superpoint Affinity for Weakly Supervised 3D Instance Segmentation☆25Dec 6, 2022Updated 3 years ago