huggingface / movie-shot-categorizerLinks
Fine-tune of Florence-2 for shot categorization.
☆24Updated 2 months ago
Alternatives and similar repositories for movie-shot-categorizer
Users that are interested in movie-shot-categorizer are comparing it to the libraries listed below
Sorting:
- A minimalistic, hackable code base to finetune Wan video generation model☆39Updated last month
- Minimal Differentiable Image Reward Functions☆56Updated last month
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆51Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 4 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆21Updated last month
- ☆33Updated 7 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆56Updated 4 months ago
- ☆28Updated 10 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 6 months ago
- ☆23Updated last year
- ☆84Updated 9 months ago
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆29Updated 3 weeks ago
- Modern Stable Diffusion models family - Fluently☆31Updated 11 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆58Updated 2 months ago
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editing☆55Updated last month
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 10 months ago
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆33Updated 8 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- ☆24Updated last year
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆43Updated last month
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆20Updated last month
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 9 months ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆70Updated 6 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- ☆22Updated 6 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆73Updated last month
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆78Updated 11 months ago