pure c/cpp cnn implementation, with CUDA accelerated.
☆21Apr 30, 2021Updated 4 years ago
Alternatives and similar repositories for SimpleCNN_Release
Users that are interested in SimpleCNN_Release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ implement a simple CNN framework to train mnist data. Done!☆10Mar 29, 2022Updated 4 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ☆14May 30, 2023Updated 2 years ago
- A Project dedicated to making GPU Partitioning on Windows easier!☆15Jan 10, 2022Updated 4 years ago
- 基于OpenCV的手写数字识别☆11Jan 10, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Aug 18, 2023Updated 2 years ago
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆38Oct 5, 2025Updated 6 months ago
- Predict Flight Trajectory based on Flight Plans and Weather Data☆16Sep 12, 2022Updated 3 years ago
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆12Dec 29, 2024Updated last year
- ☆11Oct 9, 2019Updated 6 years ago
- 🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.☆12Jul 25, 2022Updated 3 years ago
- ☆20Sep 28, 2024Updated last year
- Build CUDA Neural Network From Scratch☆22Aug 28, 2024Updated last year
- NS3 implementation of Homa Transport Protocol☆23Dec 14, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Optimized Computer Graphics Matrix Library for use with the SIMD/SSE4 Instructions.☆10Mar 19, 2020Updated 6 years ago
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year
- A fast, small, efficient pthreads based threadpool in c☆16Mar 2, 2021Updated 5 years ago
- This repo is "NTHU Parallel Programing" course project.☆10Dec 5, 2017Updated 8 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- 一步步实现c++中的智能指针☆10Jun 6, 2021Updated 4 years ago
- Triton to TVM transpiler.☆23Oct 14, 2024Updated last year
- 基于C++17实现的简易线程池(附代码解释和知识介绍)☆13Apr 14, 2023Updated 3 years ago
- PyTorch implementation of "Seed, Expand, Constrain: Three Principles for Weakly-Supervised Image Segmentation", ECCV2016☆24Jul 19, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 2020级课程设计DPLL算法解决SAT问题☆12Nov 3, 2021Updated 4 years ago
- Static timing analysis (STA) is a method of validating the timing performance of a design by checking all possible paths for timing viola…☆16Oct 4, 2022Updated 3 years ago
- Real-time facial emotion recognition is a technology that uses computer vision and machine learning to analyze a person's facial expressi…☆15Nov 3, 2023Updated 2 years ago
- ☆31May 1, 2022Updated 3 years ago
- An ATPG tool using PODEM algorithm in C++ that generates a test to detect any given list of Single-Stuck-at Faults☆11Oct 29, 2017Updated 8 years ago
- 用C++实现的一个简单的线程池,支持任务队列,实际任务继承自taskbase。☆12Apr 15, 2015Updated 10 years ago
- NTHU CS6135 VLSI實體設計自動化☆12Mar 12, 2022Updated 4 years ago
- libCircuit is a C++ Library for EDA software development☆18Sep 27, 2018Updated 7 years ago
- The repo is modified from the source code of CannyLine. It provides support for pybind to make it available for python.☆13Apr 25, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Benchmark SGLang on SLURM☆24Apr 8, 2026Updated last week
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 3 years ago
- 计算机网络微课堂笔记☆15Jul 1, 2023Updated 2 years ago
- NTHU CS5422 Parallel Programming Course Projects (include Odd-Even Sort, Mandelbrot Set, All-Pairs Shortest Path, Blocked All-Pairs Short…☆12Sep 7, 2025Updated 7 months ago
- ☆16Apr 11, 2022Updated 4 years ago
- a vue-demo:vue仿网易新闻m站☆10Jul 26, 2017Updated 8 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago