ollewelin / Installing-and-Test-PyTorch-C-API-on-Ubuntu-with-GPU-enabled
Installing and Test PyTorch C++ API on Ubuntu with GPU enabled
☆25Updated last year
Alternatives and similar repositories for Installing-and-Test-PyTorch-C-API-on-Ubuntu-with-GPU-enabled:
Users that are interested in Installing-and-Test-PyTorch-C-API-on-Ubuntu-with-GPU-enabled are comparing it to the libraries listed below
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Updated 3 years ago
- Tutorial for wrapping C++ library into Python using pybind11 and CMake☆143Updated last year
- Learn OpenMP examples step by step☆92Updated 3 months ago
- Quick and Self-Contained TensorRT Custom Plugin Implementation and Integration☆57Updated 10 months ago
- TensorRT Examples (TensorRT, Jetson Nano, Python, C++)☆94Updated last year
- NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.☆194Updated 10 months ago
- This repository contains various examples of using Eigen library☆14Updated 3 months ago
- ByteTrack-Eigen is a C++ implementation of the ByteTrack object tracking method, leveraging the Eigen library for high-performance matrix…☆41Updated last year
- The real-time Instance Segmentation Algorithm Yolov7 running on TensoRT and ONNX☆22Updated 2 years ago
- Python scripts for performing road segemtnation and car detection using the HybridNets multitask model in ONNX.☆71Updated 3 years ago
- YOLOv5 on Orin DLA☆198Updated last year
- ONNX Runtime Inference C++ Example☆235Updated 3 weeks ago
- Programming accelerated applications with CUDA C/C++, enough to be able to begin work accelerating your own CPU-only applications for per…☆94Updated 6 years ago
- C++20 N-dimensional Matrix class for hobby project☆23Updated 3 years ago
- Sample showing how to use YOLOv5 with Nvidia Isaac ROS DNN Inference☆43Updated 2 years ago
- Web-based tool to convert model into MyriadX blob☆16Updated this week
- LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.☆74Updated last year
- Serial and parallel implementations of matrix multiplication☆40Updated 4 years ago
- Examples for using SYCL on CUDA☆62Updated last month
- How to use CUDA with Python numpy☆38Updated 7 years ago
- ByteTrack for DeepStream 6.4☆18Updated last year
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆16Updated 2 weeks ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆88Updated last year
- Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.☆54Updated last year
- A detailed conversion of a C++ project to Python using pybind11☆18Updated 3 years ago
- Easy to use neural networks for NVIDIA Jetson (and desktop too!)☆74Updated 2 years ago
- Sample projects for TensorRT in C++☆194Updated 2 years ago
- TensorFlow Lite segmentation on Raspberry Pi 4 aka Unet at 7.2 FPS with 64-bit OS☆19Updated 2 years ago