nihil21 / parallel_nn
C++ implementation of a neural network using OpenMP and CUDA for parallelization.
☆9Updated 3 years ago
Alternatives and similar repositories for parallel_nn
Users that are interested in parallel_nn are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- Real Time Object Detection using OpenCV and Deep Learning☆10Updated 3 months ago
- CRISP is a Fast Image Search application that retrieves similar images from a database based on the query image by using Parallel computi…☆7Updated last year
- A Nvidia partnership project of a autonomous car MVP using Jetbot and other Nvidia tools for HPC and Transfer Learning☆29Updated 2 years ago
- Parallel implementation of the Advanced Encryption Standard.☆9Updated 6 years ago
- Parallel graph partitioning☆10Updated 7 years ago
- BITorch: Open-Source Implementation of Binary Neural Networks with PyTorch☆38Updated 11 months ago
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆17Updated last year
- Parallelizing Strassen’s matrix multiplication using OpenMP, MPI and CUDA.☆15Updated 3 years ago
- pyCUDA implementation of forward propagation for Convolutional Neural Networks☆18Updated 6 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- Intel® End-to-End AI Optimization Kit☆31Updated 9 months ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆63Updated 9 months ago
- Generating text sequences using attention-based Bi-LSTM☆11Updated 5 years ago
- ☆19Updated 3 years ago
- ☆21Updated 5 years ago
- EvoGAN: Evolutionary Algorithm based Neural Architecture Search for Generative Adversarial Networks☆9Updated 4 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆41Updated last week
- Lightweight C implementation of CNNs for Embedded Systems☆60Updated 2 years ago
- PyTorch implementation of HashedNets☆36Updated 2 years ago
- NNCG: A Neural Network Code Generator☆35Updated 9 months ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Updated 2 years ago
- 2D Convolution using NumPy☆17Updated 2 years ago
- Deep Compression for PyTorch Model Deployment on Microcontrollers☆19Updated 4 years ago
- We have implemented a framework that supports developers to structured prune neural networks of Tensorflow Models☆28Updated 6 months ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14Updated 11 months ago
- t-SNE in python from scratch☆20Updated 7 years ago
- Code for VOTS2023 Challenge tracker☆10Updated last year
- Tiny ImageNet Classification Exercise with PyTorch☆16Updated 3 years ago
- ☆17Updated last year