Fine-tune of Florence-2 for shot categorization.
☆26Mar 6, 2025Updated last year
Alternatives and similar repositories for movie-shot-categorizer
Users that are interested in movie-shot-categorizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- This is our Final Year Project titled " Implementation of seam carving for image retargeting using CUDA enabled GPU"☆11Nov 16, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 11 months ago
- ☆23Jul 22, 2025Updated 8 months ago
- Bagel but with Gradio Interface☆20May 21, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆143Jan 27, 2025Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆80Mar 17, 2025Updated last year
- ☆17Jun 14, 2024Updated last year
- ☆83Mar 10, 2025Updated last year
- UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer☆124Jun 27, 2025Updated 9 months ago
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 10 months ago
- ☆51Apr 11, 2025Updated last year
- A PyTorch implementation of NormSoftmax based on BMVC 2019 paper "Classification is a Strong Baseline for Deep Metric Learning"☆10Mar 15, 2020Updated 6 years ago
- [NeurIPS '25] Multi-Token Prediction Needs Registers☆28Dec 14, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Transform your videos into captivating animations. Processes each frame to create an animation styled with Stable Diffusion. Simply uploa…☆16May 7, 2023Updated 2 years ago
- [AAAI'26] Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression☆19Dec 21, 2025Updated 3 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆67Nov 19, 2024Updated last year
- ☆25Nov 17, 2025Updated 5 months ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Mar 13, 2023Updated 3 years ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆39Dec 30, 2025Updated 3 months ago
- The `onnx` Python library (not `onnxruntime`, to be clear) running in the browser using Pyodide.☆12Oct 12, 2023Updated 2 years ago
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆131Mar 1, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- ☆34Jun 22, 2023Updated 2 years ago
- Inference-time scaling of diffusion-based image and video generation models.☆174Dec 17, 2025Updated 4 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆19Jun 12, 2024Updated last year
- Node to tryoff clothes☆23Apr 14, 2025Updated last year
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- ☆30Nov 22, 2022Updated 3 years ago
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆192Dec 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆55Apr 9, 2026Updated last week
- ☆11Apr 18, 2021Updated 5 years ago
- Video2Video Framework for ComfyUI☆63Aug 12, 2024Updated last year
- ☆186Jun 27, 2025Updated 9 months ago
- ☆19Apr 23, 2025Updated 11 months ago
- Unofficial PyTorch implementation of "Grid Anchor based Image Cropping: A New Benchmark and An Efficient Model"☆21May 13, 2022Updated 3 years ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆17Aug 30, 2024Updated last year