AniAggarwal / ecadLinks
Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆25Updated 3 weeks ago
Alternatives and similar repositories for ecad
Users that are interested in ecad are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated 2 months ago
- A curated list of papers and resources for text-to-image evaluation.☆29Updated last year
- DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection☆20Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆32Updated 7 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆100Updated 2 weeks ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- ☆19Updated 2 years ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆31Updated last month
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆21Updated 3 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆18Updated 3 months ago
- Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)☆30Updated 10 months ago
- ☆10Updated last year
- Video Diffusion State Space Models☆19Updated last year
- ☆70Updated 7 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆106Updated 3 weeks ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- ☆39Updated last year
- Official implementation of LaVin-DiT☆35Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 11 months ago
- ☆26Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆64Updated last year
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆21Updated last month
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated last year
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆27Updated 8 months ago
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆25Updated 8 months ago
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆45Updated last month
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- ☆14Updated 4 months ago
- Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."☆28Updated last year