mrzhuzhe/riven

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mrzhuzhe/riven)

mrzhuzhe / riven

CPU Memory Compiler and Parallel programing

☆26

Alternatives and similar repositories for riven

Users that are interested in riven are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShaYeBuHui01 / flash_attention_inference
View on GitHub
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
☆15Aug 31, 2023Updated 2 years ago
CSshengxy / MEC
View on GitHub
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Apr 9, 2019Updated 7 years ago
66RING / tiny-flash-attention
View on GitHub
flash attention tutorial written in python, triton, cuda, cutlass
☆528Jan 20, 2026Updated 6 months ago
leimao / CUTLASS-Examples
View on GitHub
CUTLASS and CuTe Examples
☆137Nov 30, 2025Updated 7 months ago
LucaDavidian / Reflect
View on GitHub
a simple WIP runtime reflection library
☆13May 11, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ranran0523 / SPECNN
View on GitHub
code repo for paper accepted in ICML 2023
☆13Oct 19, 2023Updated 2 years ago
PrincetonUniversity / aspire
View on GitHub
Algorithms for Single Particle Reconstruction
☆15Apr 15, 2024Updated 2 years ago
DaisukeMiyamoto / aws-parallelcluster-relion
View on GitHub
example set up for Relion on AWS ParallelCluster for CryoEM
☆13May 21, 2022Updated 4 years ago
hova88 / CUDA-MatMul-Practice
View on GitHub
☆19Jan 4, 2024Updated 2 years ago
njuhope / cuda_sgemm
View on GitHub
☆121Apr 11, 2024Updated 2 years ago
kunpengcompute / kunpengcompute.github.io
View on GitHub
Kunpeng Tech Blog: https://kunpengcompute.github.io/
☆19Jul 8, 2021Updated 5 years ago
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
luliyucoordinate / flash-attention-minimal
View on GitHub
Flash Attention in ~100 lines of CUDA (forward pass only)
☆12Jun 10, 2024Updated 2 years ago
EuanPyle / relion4_tomo_robot
View on GitHub
Automated workflow for preparing tilt series data for RELION 4.0.
☆13Dec 17, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yhwang-hub / yolov7_QAT
View on GitHub
Quantize yolov7 using pytorch_quantization.🚀🚀🚀
☆12Oct 20, 2023Updated 2 years ago
verl-project / rl-insight
View on GitHub
Provide performance insight capabilities for RL frameworks.
☆48Updated this week
CalebDu / Awesome-Cute
View on GitHub
☆122May 16, 2025Updated last year
AntXinyuan / sph2pob
View on GitHub
(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods
☆14Aug 23, 2023Updated 2 years ago
zjhellofss / KuiperLLama
View on GitHub
校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
☆555Oct 28, 2025Updated 9 months ago
UMass-Embodied-AGI / FlexAttention
View on GitHub
[ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models
☆49Jan 8, 2025Updated last year
AlexReimann / depth_calibration
View on GitHub
Calibration of depth sensors, e.g. Kinect, Asus Xtion
☆13Apr 26, 2019Updated 7 years ago
jundaf2 / CUDA-INT8-GEMM
View on GitHub
CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API
☆37Sep 15, 2023Updated 2 years ago
HuangCongQing / cuda-learning
View on GitHub
cuda编程学习入门
☆38Jul 22, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
emptysoal / Deepsort-YOLOv5-TensorRT
View on GitHub
An object tracking project with YOLOv5-v5.0 and Deepsort, speed up by C++ and TensorRT.
☆16Oct 23, 2025Updated 9 months ago
triple-mu / TensorRT2ONNX
View on GitHub
A tool convert TensorRT engine/plan to a fake onnx
☆41Nov 22, 2022Updated 3 years ago
openebs-archive / spdk-sys
View on GitHub
Rust bindings for SPDK
☆12Mar 5, 2020Updated 6 years ago
Qwesh157 / conv_op_optimization
View on GitHub
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆44Sep 29, 2025Updated 10 months ago
mpavageau / LUFS-TruePeak
View on GitHub
LUFS and True Peak metering (app+plug)
☆11Feb 14, 2016Updated 10 years ago
MegEngine / cutlass
View on GitHub
CUDA Templates for Linear Algebra Subroutines
☆102Apr 25, 2024Updated 2 years ago
zhangcheng828 / TensorRT-Plugin
View on GitHub
☆46Apr 7, 2022Updated 4 years ago
YangLinzhuo / cuda-sgemm-optimization
View on GitHub
CUDA SGEMM optimization note
☆15Oct 31, 2023Updated 2 years ago
CedarGroveStudios / CG-35_Calculator
View on GitHub
A CircuitPython RPN Calculator
☆12Jul 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RuningMangoPi / yolov8_QAT
View on GitHub
☆17Oct 16, 2023Updated 2 years ago
Oneflow-Inc / conda-env
View on GitHub
☆12Mar 13, 2023Updated 3 years ago
genshen / advanced-gpu-programming
View on GitHub
☆13Aug 31, 2023Updated 2 years ago
yu-li / HWFI
View on GitHub
HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022
☆11Sep 7, 2022Updated 3 years ago
mpsilfve / ocrpp
View on GitHub
OCR post processing and spelling correction.
☆11Nov 12, 2018Updated 7 years ago
Tartisan / MMDet3d-PointPillars
View on GitHub
PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset
☆23Aug 11, 2022Updated 3 years ago
syncdoth / Chain-of-Hindsight-PyTorch
View on GitHub
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
☆11Apr 5, 2023Updated 3 years ago