Implementation of vDNN++; an improvement over vDNN
☆18Dec 7, 2018Updated 7 years ago
Alternatives and similar repositories for vdnn-plus-plus
Users that are interested in vdnn-plus-plus are comparing it to the libraries listed below
Sorting:
- ☆22Nov 7, 2018Updated 7 years ago
- this is the release repository of superneurons☆54Feb 13, 2021Updated 5 years ago
- Implementation of algorithms for memory optimized deep neural network training☆10Jul 23, 2020Updated 5 years ago
- ☆12May 3, 2020Updated 5 years ago
- Thinking is hard - automate it☆18Aug 24, 2022Updated 3 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- Code that accompanies the paper "Predicting the Computational Cost of Deep Learning Models"☆21Dec 14, 2018Updated 7 years ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Mar 16, 2021Updated 4 years ago
- ☆88Updated this week
- A simple script to plot the Roofline model for given HW platforms and applications☆10Aug 22, 2024Updated last year
- ☆11Aug 23, 2023Updated 2 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 6 years ago
- Self-supervised adversarial masking for point clouds☆11Jul 12, 2023Updated 2 years ago
- Themis MapReduce and TritonSort☆11Nov 2, 2017Updated 8 years ago
- Baremetal softwares for TrivialMIPS platform☆11Aug 12, 2019Updated 6 years ago
- ☆10Aug 2, 2021Updated 4 years ago
- mcp server for robot and automations☆12Feb 27, 2025Updated last year
- Profile how CUDA applications create and modify data in memory.☆14Mar 22, 2018Updated 7 years ago
- A "gym" style toolkit for building lightweight NAS systems.☆13Jun 13, 2022Updated 3 years ago
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Efficient-Tensor-Management-on-HM-for-Deep-Learning☆10Nov 15, 2021Updated 4 years ago
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 2 years ago
- Implement CollAFL using LLVM LTO pass on afl++.☆12Sep 24, 2020Updated 5 years ago
- Fast Approximate Quadratic Assignment for (Brain) Graph Matching☆16Aug 23, 2016Updated 9 years ago
- Implementing an interactive AI avatar using Python, Blender and GPT☆11Dec 5, 2023Updated 2 years ago
- A powerful desktop app turning ppt to video with AI voiceover and subtitles☆26Aug 23, 2025Updated 6 months ago
- PredRNN using TensorFlow Keras☆10Oct 18, 2018Updated 7 years ago
- 爬取酒店评论并作情感分析☆10Nov 10, 2016Updated 9 years ago
- A UserInfoView for android☆11Oct 14, 2020Updated 5 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10May 29, 2022Updated 3 years ago
- DocuGen = 你的知识库 + AI大模型 = AI自动生成专业文档☆21Jan 26, 2026Updated last month
- If it's Big Data, it's in Big Data Reference.☆10Feb 26, 2015Updated 11 years ago
- Parallel Approximate Nearest Neighbor Search☆14Nov 12, 2022Updated 3 years ago
- Distributed Deep Learning Benchmark Suite☆11Oct 31, 2022Updated 3 years ago
- A simple script to install or remove custom crontab entries☆16Mar 30, 2023Updated 2 years ago