high-performance linear attention kernel library built on TileLang
☆536May 7, 2026Updated last month
Alternatives and similar repositories for FlashQLA
Users that are interested in FlashQLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 9, 2023Updated 3 years ago
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆36Jan 18, 2025Updated last year
- [CVPR 2026 Highlight] Official implementation of Log-linear Sparse Attention (LLSA).☆81May 1, 2026Updated last month
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- ☆11Apr 5, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Accelerate multihead attention transformer model using HLS for FPGA☆13Dec 7, 2023Updated 2 years ago
- CS169.1x Software as a Service course offered by UC Berkeley at edx.org☆14Oct 28, 2014Updated 11 years ago
- An official repository for GPTailor☆18Jun 29, 2025Updated 11 months ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 6 months ago
- ☆15Mar 22, 2024Updated 2 years ago
- ☆70Jul 8, 2025Updated 11 months ago
- ☆28Jan 24, 2024Updated 2 years ago
- A comprehensive e-commerce solution that includes a fully functional website, an admin dashboard with content management capabilities, an…☆11Jul 7, 2023Updated 2 years ago
- A minimal, educational HEVC (H.265) encoder written in Python.☆53Feb 23, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- BOOM's Simulation Accelerator.☆13Dec 16, 2021Updated 4 years ago
- About Official PyTorch(MMCV) implementation of “SUMix: Mixup with Semantic and Uncertain Information” (ECCV 2024)☆12Sep 2, 2024Updated last year
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Sep 22, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆39Nov 11, 2025Updated 7 months ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Code for the ICLR'24 paper "Self-supervised Representation Learning From Random Data Projectors☆16Mar 16, 2024Updated 2 years ago
- ☆20Aug 14, 2025Updated 10 months ago
- ☆15Aug 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A cross-platform RISC-V interpreter that implements the RV32IMA instruction set.☆24Aug 23, 2022Updated 3 years ago
- self hosted responsive photo/album manager & server writen in nodejs, koa2, react, redux☆11May 25, 2017Updated 9 years ago
- MERN Fullstack E-commerce website. fully functional with Stripe and Paystack payment. Built with React, Expressjs, Nodejs and MongoDB☆17Jun 13, 2024Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- My personal website.☆17May 5, 2019Updated 7 years ago
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Mar 17, 2026Updated 3 months ago
- ☆18Mar 18, 2024Updated 2 years ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- An efficient spiking variational autoencoder☆13Nov 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆74May 13, 2026Updated last month
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆14May 24, 2024Updated 2 years ago
- ☆367Jan 28, 2026Updated 4 months ago
- [ICCV 2025] LIRA☆22Nov 25, 2025Updated 6 months ago
- Implement spike-drive using OR residual connection and propose SynA attention for natural pruning.(Under Review)☆13Mar 31, 2024Updated 2 years ago
- ☆24May 14, 2025Updated last year
- Nsight Python is a Python kernel profiling interface based on NVIDIA Nsight Tools☆204Updated this week