CPU Memory Compiler and Parallel programing
☆26Nov 18, 2024Updated last year
Alternatives and similar repositories for riven
Users that are interested in riven are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Apr 9, 2019Updated 7 years ago
- flash attention tutorial written in python, triton, cuda, cutlass☆522Jan 20, 2026Updated 4 months ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆55May 13, 2026Updated last month
- a simple WIP runtime reflection library☆13May 11, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Aug 14, 2024Updated last year
- rust 学习笔记☆11Jun 7, 2023Updated 3 years ago
- ☆122May 16, 2025Updated last year
- ☆13Oct 8, 2024Updated last year
- YOLO for Uniform Directed Object detection☆13Mar 28, 2024Updated 2 years ago
- Differentiable Vector Graphics Rasterization☆11Jan 25, 2025Updated last year
- ☆121Apr 11, 2024Updated 2 years ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆15Dec 17, 2024Updated last year
- ☆18Jan 4, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- DanesfieldApp is web based application, for Danesfield Applications running at the back-end. Using for 3D Reconstruction from satellite i…☆12Oct 28, 2023Updated 2 years ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated 2 years ago
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆49Jan 8, 2025Updated last year
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3 和Qwen2.5的大模型推理框架。☆548Oct 28, 2025Updated 7 months ago
- Google's MediaPipe (v0.8.9) and Python Wheel installer for Jetson Nano (JetPack 4.6) compiled for CUDA 10.2☆16Jun 7, 2023Updated 3 years ago
- ☆51Mar 4, 2026Updated 3 months ago
- Estimate depth from surface normal.☆12Aug 14, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆37Sep 15, 2023Updated 2 years ago
- 浙江大学 2023 学年秋冬学期《数字逻辑设计》实验文档。☆12Jan 11, 2024Updated 2 years ago
- Using TensorRT accelerate Segformer.☆11Oct 6, 2023Updated 2 years ago
- Calibration of depth sensors, e.g. Kinect, Asus Xtion☆13Apr 26, 2019Updated 7 years ago
- cuda编程学习入门☆38Jul 22, 2024Updated last year
- ☆23Aug 20, 2025Updated 9 months ago
- A CPU and GPU accelerated framework for TFHE. The framework includes algebraic, vector, and matrix operations.☆21Apr 15, 2020Updated 6 years ago
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆23Jun 10, 2026Updated last week
- CUTLASS and CuTe Examples☆136Nov 30, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Rust bindings for SPDK☆12Mar 5, 2020Updated 6 years ago
- SPBench: A Framework for Benchmarking Stream Processing Applications☆11Dec 16, 2025Updated 6 months ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- ☆13Aug 31, 2023Updated 2 years ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated last month
- SGLang Kernel Wheel Index☆23Updated this week
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago