huggingface / movie-shot-categorizer
Fine-tune of Florence-2 for shot categorization.
☆24Updated 2 months ago
Alternatives and similar repositories for movie-shot-categorizer
Users that are interested in movie-shot-categorizer are comparing it to the libraries listed below
Sorting:
- A minimalistic, hackable code base to finetune Wan video generation model☆39Updated 3 weeks ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆50Updated 5 months ago
- ☆28Updated 9 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 3 months ago
- Modern Stable Diffusion models family - Fluently☆31Updated 11 months ago
- ☆83Updated 8 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆54Updated 3 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆56Updated last month
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆29Updated this week
- ☆33Updated 6 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆17Updated 3 weeks ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 5 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- Minimal Differentiable Image Reward Functions☆55Updated last month
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 8 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 5 months ago
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editing☆54Updated 2 weeks ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 4 months ago
- An official implementation of SwapAnyone.☆59Updated 2 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 6 months ago
- ☆24Updated last year
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- Animatediff implementation. Includes a ControlNet pipeline.☆19Updated last year
- ☆46Updated 5 months ago
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆19Updated 2 weeks ago
- Public code release for the paper "ProCreate, Don’t Reproduce! Propulsive Energy Diffusion for Creative Generation"☆38Updated last week
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆74Updated 3 months ago
- DiT for VAE (and Video Generation)☆32Updated 8 months ago