yangluo7 / CAMELinks
[ACL 2023] The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"
☆92Updated 3 months ago
Alternatives and similar repositories for CAME
Users that are interested in CAME are comparing it to the libraries listed below
Sorting:
- ☆49Updated last year
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆105Updated 11 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆78Updated last year
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆130Updated 4 months ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆178Updated 4 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆30Updated 7 months ago
- Minimal Differentiable Image Reward Functions☆60Updated 2 months ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆123Updated last year
- TerDiT: Ternary Diffusion Models with Transformers☆71Updated last year
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆33Updated 4 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆60Updated last year
- ☆167Updated 5 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆46Updated 11 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆109Updated last month
- [CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆51Updated 9 months ago
- Patch convolution to avoid large GPU memory usage of Conv2D☆88Updated 5 months ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆53Updated 5 months ago
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆79Updated 8 months ago
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆142Updated last year
- Low-bit optimizers for PyTorch☆129Updated last year
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆70Updated 6 months ago
- Triton implement of bi-directional (non-causal) linear attention☆50Updated 4 months ago
- Code for NeurIPS 2023 paper "Restart Sampling for Improving Generative Processes"☆149Updated last year
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆24Updated 4 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆103Updated last year
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Updated 7 months ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆42Updated 6 months ago
- This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…☆14Updated 11 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆108Updated last month
- Code for "Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes"☆28Updated last year