Implementation of vDNN++; an improvement over vDNN
☆18Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for vdnn-plus-plus
Users that are interested in vdnn-plus-plus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Nov 7, 2018Updated 7 years ago
- this is the release repository of superneurons☆54Feb 13, 2021Updated 5 years ago
- ☆12May 3, 2020Updated 6 years ago
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- Code that accompanies the paper "Predicting the Computational Cost of Deep Learning Models"☆21Dec 14, 2018Updated 7 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)☆12Apr 8, 2021Updated 5 years ago
- Fast SGEMM emulation on Tensor Cores☆17Feb 16, 2025Updated last year
- A Light CNN Framework!☆16Apr 8, 2019Updated 7 years ago
- A proposal for a standard parallel algorithms library for ISO C++.☆22Feb 28, 2014Updated 12 years ago
- High-performance LLM operator library built on TileLang.☆111Apr 29, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆40Apr 27, 2026Updated last week
- Implement CollAFL using LLVM LTO pass on afl++.☆12Sep 24, 2020Updated 5 years ago
- ☆17Nov 3, 2025Updated 6 months ago
- This repo contains the code of the paper "RayJoin: Fast and Precise Spatial Join", ICS'24☆12Apr 29, 2026Updated last week
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 5 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- ☆13May 4, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Workflow management system for the automated and distributed analysis of large-scale experimental data.☆13Oct 3, 2024Updated last year
- Example on long write (long characteristic)☆12Sep 3, 2015Updated 10 years ago
- Python C++ Code Manager☆15Sep 29, 2024Updated last year
- Sparse matrix computation library for GPU☆59Jul 12, 2020Updated 5 years ago
- Enabling Edge-Cloud Video Analytics for Robotic Applications (INFOCOM '21)☆10Jan 7, 2021Updated 5 years ago
- ☆10Aug 2, 2021Updated 4 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- Open source application container engine,Docker学习入门笔记☆14Oct 24, 2017Updated 8 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 3 years ago
- Baremetal softwares for TrivialMIPS platform☆11Aug 12, 2019Updated 6 years ago
- A router IP written in Verilog.☆12Dec 20, 2019Updated 6 years ago
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆11Nov 15, 2021Updated 4 years ago
- ☆10Sep 14, 2023Updated 2 years ago
- Parallel Approximate Nearest Neighbor Search☆14Nov 12, 2022Updated 3 years ago
- ☆54Dec 13, 2022Updated 3 years ago