晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。
☆16Dec 15, 2024Updated last year
Alternatives and similar repositories for CUFX
Users that are interested in CUFX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Flink对用户行为数据的实时分析☆11Oct 31, 2019Updated 6 years ago
- ☆10Aug 20, 2018Updated 7 years ago
- ☆30Nov 16, 2024Updated last year
- 中国科学技术大学龙芯杯参赛作品仓库合集☆17Oct 2, 2024Updated last year
- 使用FastAPI构建发票识别系统后端服务,支持并发。使用ERFNet模型训练发票轮廓检测,进行畸变矫正,OCR识别, 模板匹配,支持倾斜发票识别。准确率99.9%。☆13May 8, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An MLIR-based compiler that takes GPU kernels and compiles them to real hardware instructions. Interactive web visualizer included.☆130Mar 21, 2026Updated 2 months ago
- This repository including most of cnn visualizations techniques using pytorch☆14Apr 14, 2020Updated 6 years ago
- 时空数据处理与组织作业(含大作业和实习)☆13Apr 16, 2023Updated 3 years ago
- Source code of the SC '23 paper: "DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multipli…☆29Jun 18, 2024Updated last year
- Persistent Bloom Filter☆12Jul 21, 2018Updated 7 years ago
- 实现一个子集c编译器,后端基于llvm20☆12Mar 13, 2025Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- CS6868: Concurrent Programming☆84May 18, 2026Updated last week
- FFTE: A Fast Fourier Transform Package (Official tarballs are unpacked into master as commits)☆12Feb 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GEMM☆10Aug 26, 2023Updated 2 years ago
- A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…☆26Dec 20, 2022Updated 3 years ago
- ☆12Jul 4, 2020Updated 5 years ago
- 基于yoloV4,检测茶叶中的杂质,并利用混淆矩阵计算识别率☆18Aug 25, 2020Updated 5 years ago
- ☆11May 16, 2026Updated last week
- learn-P4-by-examples: P4 examples with Chinese documents.☆14Oct 25, 2019Updated 6 years ago
- Naos: Serialization-free RDMA networking in Java☆17Aug 17, 2021Updated 4 years ago
- atss的Pytorch实现,支持多卡分布式训练☆16Jan 3, 2021Updated 5 years ago
- 2023 OceanBase 数据库大赛初赛☆118May 8, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Aug 28, 2025Updated 8 months ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 8 months ago
- 重庆大学计算机学院计算机科学与技术课程相关文档和实验☆21Mar 3, 2023Updated 3 years ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ☆14Nov 3, 2025Updated 6 months ago
- Free resource for the book AI Compiler Development Guide☆50Dec 22, 2022Updated 3 years ago
- A modern C++ library for working with JSON data, aims to provide full support for the JSON standard, as well as allowing users to customi…☆12Apr 8, 2026Updated last month
- Handy tools & graphics API abstraction for blazing fast prototyping☆10Jan 17, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Deep Learning Demo☆18Oct 14, 2018Updated 7 years ago
- This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference sp…☆56Apr 24, 2026Updated last month
- 用C++和Python实现从头实现一个深度学习训练框架☆12Nov 22, 2020Updated 5 years ago
- DoubleAI’s hyperoptimised version of cuGraph☆52Mar 3, 2026Updated 2 months ago
- ☆18Nov 22, 2025Updated 6 months ago
- ☆12Jan 25, 2023Updated 3 years ago
- This is a PyTorch implementation of the paper ReCoNet.☆17May 6, 2019Updated 7 years ago