The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Unified Device Architecture" (CUDA), hiding much of the complexity of the underlying CUDA functions from the programmer (see the brief overview of the main features). Original author: Markus Grabner
☆27Sep 7, 2011Updated 14 years ago
Alternatives and similar repositories for cudatemplates
Users that are interested in cudatemplates are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools to convert Caffe models to neon's serialization format☆38Jan 3, 2023Updated 3 years ago
- Distributed optimization framework with parameter server☆23Jun 14, 2015Updated 10 years ago
- C++ implementation of the fast learning algorithm for deep belief nets from Hinton et al. (2006).☆19Jan 4, 2014Updated 12 years ago
- Python wrapper of selective search like algorithm in dlib☆15Aug 31, 2015Updated 10 years ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (Javascript) Animated, multiple progress bar control and tiny chart (sparkline)☆22Nov 6, 2013Updated 12 years ago
- Stream objects to a Mongo database☆47Sep 10, 2017Updated 8 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆11May 30, 2018Updated 8 years ago
- ☆28May 9, 2016Updated 10 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Fork of the code for Linux and OSX☆18Sep 30, 2014Updated 11 years ago
- A light-weight deep convolutional neural network for face detection☆13Mar 8, 2019Updated 7 years ago
- A handy tool to generate common files in command line☆24Dec 25, 2021Updated 4 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- semantically labels kinect pointclouds☆16Jul 31, 2012Updated 13 years ago
- 收集/分享我常用的一些 Alfred 2 Workflows☆29May 30, 2014Updated 12 years ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Dec 20, 2019Updated 6 years ago
- ☆48May 16, 2014Updated 12 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆11Feb 12, 2023Updated 3 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 10 months ago
- Spatio-temporal pattern contruct and model fusion☆11Jun 10, 2019Updated 6 years ago
- ☆10Dec 10, 2024Updated last year
- Human Evaluation Benchmark for Text Simplification☆10Sep 6, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Julia implementation of LULESH with MPI + X.☆13Jul 23, 2022Updated 3 years ago
- 华容道,一种单人拼图类游戏☆10Feb 12, 2019Updated 7 years ago
- ☆11Jan 22, 2015Updated 11 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- Caffe re-implementation of dynamic network surgery.☆18Jun 15, 2018Updated 7 years ago
- Datasets and notebooks☆13Oct 26, 2016Updated 9 years ago
- This repository contains my experiments with compression-related algorithms☆39Jun 18, 2016Updated 9 years ago
- A simple graph type with data on the vertices and edges.☆13Updated this week
- Anchored Diffusion Language Model (NeurIPS 2025)☆29Oct 13, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- jacobs university fall 2012 3d point cloud processing homework☆13Nov 19, 2012Updated 13 years ago
- Gated Recurrent Unit with Low-rank matrix factorization☆35Mar 11, 2016Updated 10 years ago
- Scripts for monitoring InfiniBand and storage devices☆11Sep 4, 2015Updated 10 years ago
- Dynamic dispatch over arbitrary predicates☆10Feb 2, 2016Updated 10 years ago
- Put color back into monochromatic photos of nature.☆10Apr 21, 2023Updated 3 years ago
- Small and easy C++ AdaBoost Implementation☆70May 17, 2011Updated 15 years ago
- Interface designs for enforcing static computations in array functions with Julia☆15Apr 29, 2026Updated last month