A basic pure pytorch implementation of flash attention
☆16Oct 28, 2024Updated last year
Alternatives and similar repositories for flash_attention
Users that are interested in flash_attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Here you find the Spectral Operator Learning Under construcTION☆20May 8, 2024Updated 2 years ago
- new optimizer☆20Aug 4, 2024Updated last year
- ☆17Oct 22, 2024Updated last year
- Experimental GPU language with meta-programming☆31Sep 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated 2 weeks ago
- ☆20May 30, 2024Updated last year
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 3 months ago
- Source code for paper "Learning the Solution Operator of Boundary Value Problems using Graph Neural Networks"☆20Jul 25, 2024Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆27Jan 26, 2026Updated 3 months ago
- ☆34May 14, 2025Updated last year
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆21Aug 5, 2025Updated 9 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Dec 19, 2023Updated 2 years ago
- FlexiTokens☆23Dec 27, 2025Updated 4 months ago
- [ICML 2024] Official PyTorch implementation of the Vectorized Conditional Neural Field.☆18Aug 1, 2024Updated last year
- Extensive time series analysis of chinese PM2.5 content, using models from ARMA and VAR to LSTMs and dynamic time warping clustering☆12Aug 17, 2019Updated 6 years ago
- An API designed for code completion and fine-tuning of open-source large language models on internal codebases and documents.☆14Oct 17, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 10 months ago
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding☆69Apr 29, 2026Updated 3 weeks ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- Bilinear interpolation on grids with jax☆16May 13, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆15May 10, 2024Updated 2 years ago
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 4 years ago
- Continual Learning Toolkit for Reinforcement Learning☆21Jan 28, 2018Updated 8 years ago
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated last year
- ☆40May 8, 2026Updated 2 weeks ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆613Oct 7, 2025Updated 7 months ago
- [NeurIPS 2024 Spotlight] Towards Universal Mesh Movement Networks☆19Jul 16, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆52Jan 28, 2024Updated 2 years ago
- Rheological Universal Differential Equations: scientific machine learning for modeling complex fluids☆21Dec 23, 2022Updated 3 years ago
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- Avisynth DeFlicker plugin can remove old film intensity flicker by temporal mean luma smoothing.☆15May 23, 2019Updated 6 years ago
- sync google contacts with information from the dominos data breach <3☆11May 24, 2021Updated 4 years ago
- ☆23May 22, 2024Updated 2 years ago
- Spectral Neural Operator☆84Dec 20, 2023Updated 2 years ago