Stability-AI / flash-attentionLinks
Fast and memory-efficient exact attention
☆11Updated 2 years ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Updated 2 years ago
- Consistency models trained on CIFAR-10, in JAX.☆150Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- Implementation of the premier Text to Video model from OpenAI☆56Updated last year
- Can RL solve simple problems?☆54Updated 2 years ago
- ☆62Updated 2 years ago
- Load & manage evolving datasets efficiently☆23Updated 5 months ago
- ☆52Updated 2 years ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Updated 2 years ago
- Exploration into the Firefly algorithm in Pytorch☆41Updated 11 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Implementation of a framework for Genie2 in Pytorch☆156Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Updated 3 months ago
- LLM training in simple, raw C/CUDA☆18Updated last year
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- A minimal version of GPT-2 using PyTorch and Gemma 3 using JAX.☆47Updated last month
- Implementation of the Llama architecture with RLHF + Q-learning☆170Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- ☆63Updated last year
- Synthetic data generator for image, video and 3D models☆32Updated last year
- Minimal Implimentation of VCRec (2024) for collapse provention.☆17Updated last year
- This repository will contain code for the paper "CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot tr…☆26Updated 2 years ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated 2 years ago
- ☆138Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Convert natural language to LaTeX within Overleaf using LLMs☆120Updated 3 years ago
- 🏥 Health monitor for a Petals swarm☆40Updated last year