huggingface / movie-shot-categorizerView external linksLinks
Fine-tune of Florence-2 for shot categorization.
☆26Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for movie-shot-categorizer
Users that are interested in movie-shot-categorizer are comparing it to the libraries listed below
Sorting:
- This is our Final Year Project titled " Implementation of seam carving for image retargeting using CUDA enabled GPU"☆11Nov 16, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 8 months ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".☆52Dec 16, 2025Updated 2 months ago
- CogView2 for GPUs with 12/16/24GB vRAM☆16Jun 24, 2022Updated 3 years ago
- profile tools for pytorch nn models☆42Jan 11, 2021Updated 5 years ago
- ☆82Mar 10, 2025Updated 11 months ago
- Unofficial implementation for SOLO instance segmentation☆25Mar 29, 2020Updated 5 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- Public dataset for paper "Bootstrapping Complete The Look at Pinterest"☆34Jun 23, 2020Updated 5 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆30Nov 22, 2022Updated 3 years ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆141Jan 27, 2025Updated last year
- The official repository of DreamMover☆34Sep 20, 2024Updated last year
- ☆34Jun 22, 2023Updated 2 years ago
- Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.☆37Mar 13, 2023Updated 2 years ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- A complete pipeline for fine-tuning YOLOv8 pose models with custom datasets. Supports automatic and semi-automatic annotation for efficie…☆15Feb 9, 2025Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Aug 4, 2023Updated 2 years ago
- Python library for the enigma machine☆16Mar 7, 2024Updated last year
- ☆13May 8, 2024Updated last year
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆20Dec 19, 2025Updated last month
- assembles images in a grid☆34Jan 30, 2020Updated 6 years ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Dec 17, 2025Updated 2 months ago
- ☆16Jun 12, 2025Updated 8 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆10Oct 27, 2023Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆12Dec 31, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- TensorFlow code for our ICCV 2019 paper "Multimodal Style Transfer via Graph Cuts"☆41Dec 20, 2019Updated 6 years ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆26Nov 14, 2025Updated 3 months ago
- FlexiTokens☆18Dec 27, 2025Updated last month
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- 为visinger SVS系统写的展示系统~本质仍然是个音乐播放器☆11Apr 18, 2023Updated 2 years ago
- Efficient Feature Extraction for High-resolution Video Frame Interpolation (BMVC 2022)☆13Aug 24, 2023Updated 2 years ago