Fast and memory-efficient exact attention ported to rocm
☆13Dec 1, 2023Updated 2 years ago
Alternatives and similar repositories for flash-attention-rocm
Users that are interested in flash-attention-rocm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The small, fast game engine for Compose Multiplatform☆10Feb 1, 2025Updated last year
- Fast and memory-efficient exact attention☆226Updated this week
- Document how to make ROCm acceleration on windows 11 for LM Studio and Comfy UI☆36Dec 9, 2025Updated 3 months ago
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA en…☆51Aug 25, 2024Updated last year
- Fully automated end to end framework to extract data from complex charts and other figures in scientific literature.☆20Oct 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆38Dec 18, 2025Updated 3 months ago
- ☆39Oct 10, 2025Updated 5 months ago
- A collection of scripts to interact with the ESP32 Bus Pirate, log data, dump to file and automate hardware tasks.☆77Feb 6, 2026Updated last month
- Official implementation of "LoFA: Learning to Predict Personalized Prior for Fast Adaptation of Visual Generative Models".☆36Feb 1, 2026Updated last month
- The largest KG for material science☆29Nov 14, 2024Updated last year
- Scholarly Big Data Subject Category Classifier☆10Jul 15, 2019Updated 6 years ago
- A Python module for mapping multiple high-dimensional datasets into a common low-dimensional space.☆10Mar 29, 2018Updated 7 years ago
- Training-free Stylized Text-to-Image Generation with Fast Inference☆27May 30, 2025Updated 9 months ago
- Topic Evolution Analysis - an algorithm for analyzing knowledge flow in text based corpora☆14Oct 16, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.☆37Jan 16, 2026Updated 2 months ago
- SkyReels-V2 with batch mode, video input (extend existing videos), and multiple prompts.☆17May 5, 2025Updated 10 months ago
- Unsupervised learning coupled with applied factor analysis to the five-factor model (FFM), a taxonomy for personality traits used to desc…☆16Jun 19, 2021Updated 4 years ago
- Async, high-performance Kotlin library for interacting with EVM-based blockchains. Targeting JVM and Android platforms.☆58Updated this week
- Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.☆18Jan 15, 2025Updated last year
- Web-Scarping tool for downloading the content of the following publishers: Elsevier, RSC, Web of Science, Springer Nature , Wiley.☆37Jun 24, 2025Updated 9 months ago
- Example on how to use pytorch/yolov8 object detection on computers with AMD integrated GPUs☆24Feb 5, 2024Updated 2 years ago
- SciQAG is a novel framework for automatically generating high-quality science question-answer pairs from a large corpus of scientific lit…☆33Mar 24, 2025Updated last year
- Code of the paper Graph Convolutions over Constituent Trees for Syntax-Aware Semantic Role Labeling☆15Nov 15, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Directed masked autoencoders☆14Mar 17, 2026Updated last week
- Handbook of Network Analysis – part of the KONECT project by Jérôme Kunegis☆16Apr 7, 2023Updated 2 years ago
- A parallelized implementation of Principal Component Analysis (PCA) using Singular Value Decomposition (SVD) in OpenMP for C. The procedu…☆17May 9, 2019Updated 6 years ago
- 获取爱斯维尔期刊投稿,进入under review之后的审稿进度☆37Feb 24, 2025Updated last year
- Render pyecharts as image via pyppeteer☆20Nov 13, 2019Updated 6 years ago
- A next-generation AI-powered infinite canvas workspace built for creators and developers. Experience the future of Generative AI with a d…☆65Jan 24, 2026Updated 2 months ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Automatically hold idle GPU.☆78Nov 9, 2025Updated 4 months ago
- All in one PDF Parser Toolkit☆17Sep 15, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Error detection in Knowledge Graphs: Path Ranking, Embeddings or both?☆12Jan 26, 2020Updated 6 years ago
- ☆16Dec 7, 2021Updated 4 years ago
- A simple ring (circular) buffer implementation for the .NET framework, written in C#☆17Aug 28, 2019Updated 6 years ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆51Nov 12, 2025Updated 4 months ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- Document Classification on COVID-19 Literature using the LitCovid collection and the Hedwig library.☆16Oct 26, 2024Updated last year
- An example of graph embeddings for wikipedia page recommendations☆11Aug 26, 2021Updated 4 years ago