Fine-tune of Florence-2 for shot categorization.
☆26Mar 6, 2025Updated last year
Alternatives and similar repositories for movie-shot-categorizer
Users that are interested in movie-shot-categorizer are comparing it to the libraries listed below
Sorting:
- This is our Final Year Project titled " Implementation of seam carving for image retargeting using CUDA enabled GPU"☆11Nov 16, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 9 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- CogView2 for GPUs with 12/16/24GB vRAM☆16Jun 24, 2022Updated 3 years ago
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆54Mar 1, 2026Updated last week
- profile tools for pytorch nn models☆42Jan 11, 2021Updated 5 years ago
- ☆82Mar 10, 2025Updated 11 months ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆80Mar 17, 2025Updated 11 months ago
- ☆31Dec 20, 2022Updated 3 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 10 months ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆30Nov 22, 2022Updated 3 years ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆141Jan 27, 2025Updated last year
- The official repository of DreamMover☆34Sep 20, 2024Updated last year
- ☆34Jun 22, 2023Updated 2 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Mar 13, 2023Updated 2 years ago
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆21Dec 19, 2025Updated 2 months ago
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Aug 4, 2023Updated 2 years ago
- A complete pipeline for fine-tuning YOLOv8 pose models with custom datasets. Supports automatic and semi-automatic annotation for efficie…☆15Feb 9, 2025Updated last year
- Python library for the enigma machine☆16Mar 7, 2024Updated 2 years ago
- assembles images in a grid☆34Jan 30, 2020Updated 6 years ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Dec 17, 2025Updated 2 months ago
- ☆51Apr 11, 2025Updated 10 months ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Mar 7, 2023Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- ☆10Oct 27, 2023Updated 2 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- The `onnx` Python library (not `onnxruntime`, to be clear) running in the browser using Pyodide.☆12Oct 12, 2023Updated 2 years ago
- TensorFlow code for our ICCV 2019 paper "Multimodal Style Transfer via Graph Cuts"☆41Dec 20, 2019Updated 6 years ago
- ☆11Oct 29, 2022Updated 3 years ago
- 3rd party dependencies for DALI project☆11Updated this week
- Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.☆10Jun 1, 2017Updated 8 years ago