CPU Memory Compiler and Parallel programing
☆26Nov 18, 2024Updated last year
Alternatives and similar repositories for riven
Users that are interested in riven are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆16Aug 31, 2023Updated 2 years ago
- flash attention tutorial written in python, triton, cuda, cutlass☆508Jan 20, 2026Updated 3 months ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆50Mar 30, 2026Updated last month
- a simple WIP runtime reflection library☆13May 11, 2022Updated 3 years ago
- ☆23Aug 14, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Documentations for RELION☆15Mar 13, 2026Updated last month
- example set up for Relion on AWS ParallelCluster for CryoEM☆13May 21, 2022Updated 3 years ago
- ☆120Apr 11, 2024Updated 2 years ago
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆12Oct 20, 2023Updated 2 years ago
- Automated workflow for preparing tilt series data for RELION 4.0.☆13Dec 17, 2023Updated 2 years ago
- ☆12Feb 7, 2018Updated 8 years ago
- 基于QOpenGLWidget,实现点云载入,显示,鼠标键盘交互。点云的旋转,平移,放大缩小等功能☆11May 7, 2020Updated 6 years ago
- Kunpeng Tech Blog: https://kunpengcompute.github.io/☆19Jul 8, 2021Updated 4 years ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆532Oct 28, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Some C++/C/CUDA Extension☆16Feb 2, 2022Updated 4 years ago
- A crypto-assisted framework for protecting the privacy of models and queries in inference.☆19Oct 28, 2021Updated 4 years ago
- Google's MediaPipe (v0.8.9) and Python Wheel installer for Jetson Nano (JetPack 4.6) compiled for CUDA 10.2☆16Jun 7, 2023Updated 2 years ago
- ☆49Mar 4, 2026Updated 2 months ago
- Estimate depth from surface normal.☆12Aug 14, 2020Updated 5 years ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆28Nov 11, 2025Updated 5 months ago
- An object tracking project with YOLOv5-v5.0 and Deepsort, speed up by C++ and TensorRT.☆16Oct 23, 2025Updated 6 months ago
- CUDA 8-bit Tensor Core Matrix Multiplication based on m16n16k16 WMMA API☆37Sep 15, 2023Updated 2 years ago
- Using TensorRT accelerate Segformer.☆11Oct 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 浙江大学 2023 学年秋冬学期《数字逻辑设计》实验文档。☆12Jan 11, 2024Updated 2 years ago
- Calibration of depth sensors, e.g. Kinect, Asus Xtion☆13Apr 26, 2019Updated 7 years ago
- cuda编程学习入门☆38Jul 22, 2024Updated last year
- ☆23Aug 20, 2025Updated 8 months ago
- A CPU and GPU accelerated framework for TFHE. The framework includes algebraic, vector, and matrix operations.☆21Apr 15, 2020Updated 6 years ago
- ☆10Jan 3, 2024Updated 2 years ago
- CUTLASS and CuTe Examples☆135Nov 30, 2025Updated 5 months ago
- ☆12Aug 31, 2023Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SGLang Kernel Wheel Index☆22Updated this week
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago
- This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.☆43Sep 29, 2025Updated 7 months ago
- Performance Engineering of Software Systems (6.172)☆28Feb 27, 2020Updated 6 years ago
- CUDA Templates for Linear Algebra Subroutines☆101Apr 25, 2024Updated 2 years ago
- CUDA SGEMM optimization note☆15Oct 31, 2023Updated 2 years ago
- PointPillars TensorRT version pretrained on MMDetection3d with WaymoOpenDataset☆23Aug 11, 2022Updated 3 years ago