CPU Memory Compiler and Parallel programing
☆26Nov 18, 2024Updated last year
Alternatives and similar repositories for riven
Users that are interested in riven are comparing it to the libraries listed below
Sorting:
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆15Aug 31, 2023Updated 2 years ago
- flash attention tutorial written in python, triton, cuda, cutlass☆490Jan 20, 2026Updated last month
- Fractional interpolation using a Farrow structure☆10Oct 11, 2023Updated 2 years ago
- ☆40Updated this week
- This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.☆43Sep 29, 2025Updated 5 months ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆507Oct 28, 2025Updated 4 months ago
- ☆12Feb 7, 2018Updated 8 years ago
- CUTLASS and CuTe Examples☆133Nov 30, 2025Updated 3 months ago
- Collision-detection and collision-avoidance navigation demonstration using a feedforward neural network.☆13Nov 4, 2018Updated 7 years ago
- ☆13May 25, 2023Updated 2 years ago
- A CircuitPython RPN Calculator☆12Jul 22, 2025Updated 7 months ago
- ☆10Apr 9, 2017Updated 8 years ago
- ☆12Mar 13, 2023Updated 2 years ago
- Mini CCL - A lightweight collective communication library☆25Jan 2, 2026Updated 2 months ago
- DanesfieldApp is web based application, for Danesfield Applications running at the back-end. Using for 3D Reconstruction from satellite i…☆11Oct 28, 2023Updated 2 years ago
- A c++ hash map/table which utilizes simd (specifically Intel x86 SSE/AVX)☆11Apr 30, 2019Updated 6 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- This iOS app demonstrates how to read PCM samples from a large wave files into a circular buffer, so that they can be processed and playe…☆18Feb 8, 2013Updated 13 years ago
- rust 学习笔记☆11Jun 7, 2023Updated 2 years ago
- Code for the paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers" with GPT-J implementation.☆15Mar 22, 2023Updated 2 years ago
- Super Resolution Gaming Dataset☆11Jan 5, 2025Updated last year
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 3 months ago
- Guide to deploying deep-learning inference networks and deep vision primitives on SOPHON TPU.☆19Nov 14, 2025Updated 3 months ago
- ☆13Sep 19, 2025Updated 5 months ago
- [ICME 2024] DIIF (Dynamic Implicit Image Function for Efficient Arbitrary-Scale Super-Resolution).☆13Mar 13, 2024Updated last year
- ☆14Oct 9, 2022Updated 3 years ago
- CUDA_C编程权威指南示例代码☆13Mar 22, 2023Updated 2 years ago
- This project aim to convert video files to different encrypted pieces and play via only in built player,just like offline download videos…☆13Jan 28, 2019Updated 7 years ago
- ☆24May 9, 2025Updated 10 months ago
- ☆12Aug 31, 2023Updated 2 years ago
- TEF6686☆13Feb 11, 2020Updated 6 years ago
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- ☆10May 17, 2024Updated last year
- Master☆16Jul 30, 2025Updated 7 months ago
- OCR post processing and spelling correction.☆11Nov 12, 2018Updated 7 years ago
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- https://github.com/bernakabadayi/ganavatar☆12Oct 8, 2024Updated last year
- Flash Attention in ~100 lines of CUDA (forward pass only)☆11Jun 10, 2024Updated last year