flash attention 优化日志
☆27Jun 4, 2025Updated 9 months ago
Alternatives and similar repositories for flash-attention-opt
Users that are interested in flash-attention-opt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blogs content and codes☆23Jan 25, 2021Updated 5 years ago
- Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".☆13Feb 4, 2025Updated last year
- c++模板与泛型,元编程,学习资料☆13Feb 20, 2023Updated 3 years ago
- 使用TensorRT部署SlowFast模型☆24Mar 2, 2022Updated 4 years ago
- blog☆11Sep 23, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Sep 28, 2023Updated 2 years ago
- This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platfor…☆24Dec 11, 2024Updated last year
- Sound source localization using SRP-PHAT☆25Feb 17, 2019Updated 7 years ago
- Multiple Lidar preprocessor for BEVfusion☆11Aug 25, 2023Updated 2 years ago
- ☆41Oct 11, 2025Updated 5 months ago
- tpu-systolic-array-weight-stationary☆25May 7, 2021Updated 4 years ago
- ☆30Oct 17, 2025Updated 5 months ago
- A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.☆42Mar 4, 2026Updated 3 weeks ago
- Youtue Player for vue 3.x☆26Jul 2, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of Microscaling data formats in SystemVerilog.☆30Jul 6, 2025Updated 8 months ago
- PyTorch Implementation of Temporal Shift Module for Jester☆13Nov 22, 2022Updated 3 years ago
- ☆40Oct 8, 2024Updated last year
- The Custom Go programming language for scientific/mathematics computing !!!!☆92Feb 21, 2026Updated last month
- 根据毫米波雷达和视频融合数据,基于决策树算法,计算交叉口的相序和配时参数☆18Jun 1, 2021Updated 4 years ago
- Optimize softmax in triton in many cases☆23Sep 6, 2024Updated last year
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆48Jan 7, 2021Updated 5 years ago
- A Toy Implementation in Python of [FV12]☆23Aug 16, 2023Updated 2 years ago
- A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, …☆23Aug 23, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Python implementation of the algorithm described in the paper "Efficient generation of simple polygons for characterizing the shape of a …☆20Jun 27, 2022Updated 3 years ago
- ☆32Sep 30, 2025Updated 5 months ago
- yolov8 tensorrt 加速☆52Jan 16, 2023Updated 3 years ago
- ☆12Jan 25, 2023Updated 3 years ago
- ☆11Mar 24, 2023Updated 3 years ago
- Pytorch、Numpy实现NMS、Soft-NMS代码☆12Mar 22, 2021Updated 5 years ago
- Simple Tensorflow implementation of "SRM : A Style-based Recalibration Module for Convolutional Neural Networks"☆18May 30, 2019Updated 6 years ago
- convert cifar-10 dataset from bin to png or jpg☆57Mar 16, 2017Updated 9 years ago
- ☆10Feb 26, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 由视频数据集生成optical flow,frames,warped flow☆20Dec 29, 2017Updated 8 years ago
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated 2 years ago
- Step by Step ResUnet Model Architecture using Keras☆11Mar 2, 2022Updated 4 years ago
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Jun 1, 2020Updated 5 years ago
- ☆14Nov 17, 2021Updated 4 years ago
- ☆10Mar 3, 2021Updated 5 years ago
- Implement some method of LLM KV Cache Sparsity☆41Jun 6, 2024Updated last year