Fast and memory-efficient exact attention ported to rocm
☆13Dec 1, 2023Updated 2 years ago
Alternatives and similar repositories for flash-attention-rocm
Users that are interested in flash-attention-rocm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The small, fast game engine for Compose Multiplatform☆10Feb 1, 2025Updated last year
- Fast and memory-efficient exact attention☆230Apr 27, 2026Updated last week
- Document how to make ROCm acceleration on windows 11 for LM Studio and Comfy UI☆37Dec 9, 2025Updated 4 months ago
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA en…☆52Aug 25, 2024Updated last year
- Fully automated end to end framework to extract data from complex charts and other figures in scientific literature.☆20Oct 30, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆39Dec 18, 2025Updated 4 months ago
- ☆40Oct 10, 2025Updated 6 months ago
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆38Feb 1, 2026Updated 3 months ago
- The largest KG for material science☆30Nov 14, 2024Updated last year
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- A collection of scripts to interact with the ESP32 Bus Pirate, log data, dump to file and automate hardware tasks.☆80Feb 6, 2026Updated 3 months ago
- A Python module for mapping multiple high-dimensional datasets into a common low-dimensional space.☆10Mar 29, 2018Updated 8 years ago
- A curated list of outstanding Free, Libre, and Open Source Software (FLOSS) Computer Algebra Systems (CAS) for mathematicians, educators,…☆41May 3, 2024Updated 2 years ago
- Training-free Stylized Text-to-Image Generation with Fast Inference☆27May 30, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Topic Evolution Analysis - an algorithm for analyzing knowledge flow in text based corpora☆14Oct 16, 2016Updated 9 years ago
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆37Jan 16, 2026Updated 3 months ago
- SkyReels-V2 with batch mode, video input (extend existing videos), and multiple prompts.☆17May 5, 2025Updated last year
- Unsupervised learning coupled with applied factor analysis to the five-factor model (FFM), a taxonomy for personality traits used to desc…☆16Jun 19, 2021Updated 4 years ago
- ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling☆130Mar 31, 2026Updated last month
- Async, high-performance Kotlin library for interacting with EVM-based blockchains. Targeting JVM and Android platforms.☆60Updated this week
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- Web-Scarping tool for downloading the content of the following publishers: Elsevier, RSC, Web of Science, Springer Nature , Wiley.☆40Jun 24, 2025Updated 10 months ago
- Example on how to use pytorch/yolov8 object detection on computers with AMD integrated GPUs☆24Feb 5, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- SciQAG is a novel framework for automatically generating high-quality science question-answer pairs from a large corpus of scientific lit…☆34Mar 24, 2025Updated last year
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- Directed masked autoencoders☆14Mar 25, 2026Updated last month
- Handbook of Network Analysis – part of the KONECT project by Jérôme Kunegis☆16Apr 7, 2023Updated 3 years ago
- A parallelized implementation of Principal Component Analysis (PCA) using Singular Value Decomposition (SVD) in OpenMP for C. The procedu…☆17May 9, 2019Updated 6 years ago
- 获取爱斯维尔期刊投稿,进入under review之后的审稿进度☆38Feb 24, 2025Updated last year
- Render pyecharts as image via pyppeteer☆20Nov 13, 2019Updated 6 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- All in one PDF Parser Toolkit☆17Sep 15, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Error detection in Knowledge Graphs: Path Ranking, Embeddings or both?☆12Jan 26, 2020Updated 6 years ago
- ☆16Dec 7, 2021Updated 4 years ago
- Automatically hold idle GPU.☆78Nov 9, 2025Updated 5 months ago
- A simple ring (circular) buffer implementation for the .NET framework, written in C#☆17Aug 28, 2019Updated 6 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Oct 26, 2024Updated last year
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆54Nov 12, 2025Updated 5 months ago