fal-ai / flashpackLinks
High-throughput tensor loading for PyTorch
☆221Updated 2 weeks ago
Alternatives and similar repositories for flashpack
Users that are interested in flashpack are comparing it to the libraries listed below
Sorting:
- Making Flux go brrr on GPUs.☆161Updated last month
- Focused on fast experimentation and simplicity☆80Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- faster parallel inference of mochi-1 video generation model☆125Updated 11 months ago
- Comparison of different stable diffusion implementations and optimizations☆41Updated 2 years ago
- [WIP] Better (FP8) attention for Hopper☆32Updated 11 months ago
- ☆48Updated 11 months ago
- ☆28Updated 4 months ago
- ☆24Updated last year
- ☆175Updated 3 months ago
- Writing FLUX in Triton☆41Updated last year
- This repository provides a minimal, single-file implementation of SingLoRA (Single Matrix Low-Rank Adaptation) as described in the paper …☆44Updated this week
- RAM is all you need☆260Updated 2 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Updated last year
- ☆30Updated last year
- ☆79Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 11 months ago
- Model code for inferencing T5☆66Updated 10 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆392Updated last month
- ☆27Updated last year
- ☆167Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆14Updated 3 weeks ago
- Optimizing diffusion for production-ready speeds☆34Updated 3 weeks ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- Let's try and finetune the OpenAI consistency decoder to work for SDXL☆24Updated 2 years ago
- JAX port of FLUX.1 models using flax.nnx☆24Updated last year
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆25Updated last year
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated last year
- ☆39Updated last year
- ☆69Updated last year