Stability-AI / flash-attentionLinks
Fast and memory-efficient exact attention
☆11Updated 2 years ago
Alternatives and similar repositories for flash-attention
Users that are interested in flash-attention are comparing it to the libraries listed below
Sorting:
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Updated 2 years ago
- Consistency models trained on CIFAR-10, in JAX.☆150Updated 2 years ago
- JAX implementation of the Llama 2 model☆216Updated 2 years ago
- ☆31Updated 2 years ago
- A synthetic story narration dataset to study small audio LMs.☆31Updated 2 years ago
- Hugging Face's Zapier Integration 🤗⚡️☆50Updated 2 years ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated 2 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆87Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- Can RL solve simple problems?☆54Updated 2 years ago
- Convert natural language to LaTeX within Overleaf using LLMs☆120Updated 3 years ago
- Self-Conditioning Pre-Trained Language Models, ICML 2022☆34Updated 3 years ago
- ☆63Updated last year
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Updated last year
- ☆138Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- ☆217Updated 2 months ago
- Load & manage evolving datasets efficiently☆23Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Implementation of the Llama architecture with RLHF + Q-learning☆170Updated last year
- Runs the InstructorXL model to compute embeddings from a Parquet file. Please contribute and improve!☆39Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated 2 years ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- ☆32Updated last year
- GoldFinch and other hybrid transformer components☆12Updated last month
- ☆62Updated 2 years ago
- Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.☆91Updated 2 years ago
- Open-source implementation of AlphaEvolve☆23Updated 8 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago