This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
☆42Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for mica-MovieCLIP
Users that are interested in mica-MovieCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆16Oct 27, 2024Updated last year
- A dataset with classified film shots☆11Aug 8, 2022Updated 3 years ago
- StyleGAN - Official TensorFlow Implementation☆14Sep 13, 2019Updated 6 years ago
- ☆22Feb 25, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Video dataset and code for transforming a video's aspect ratio, from our papers "A fast smart-cropping method and dataset for video retar…☆29Jun 7, 2022Updated 3 years ago
- Roof segmentation in satellite images☆13Mar 25, 2023Updated 3 years ago
- A pipeline focused on the in-painting of text in images. For example the removal of subtitles in a screenshot of a movie.☆16Jun 30, 2022Updated 3 years ago
- The source code of ExFunTube☆10Aug 8, 2025Updated 9 months ago
- Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices (ACML 2023)☆16May 7, 2024Updated 2 years ago
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆15Feb 22, 2023Updated 3 years ago
- A rework of the gradio WebUI for the open-source unified multimodal model by ByteDance☆21Jun 3, 2025Updated 11 months ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆172Mar 23, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于Dolphin模型的东方语言音视频转字幕api及webui☆18Apr 3, 2025Updated last year
- Vapoursynth filter using ProPainter: Improving Propagation and Transformer for Video Inpainting☆18Mar 26, 2026Updated last month
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- ☆10Nov 27, 2024Updated last year
- Blender导入小二预设材质、节点组☆33Feb 26, 2026Updated 2 months ago
- ☆15Jan 9, 2024Updated 2 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆102Nov 6, 2024Updated last year
- Unity3D shader workshop for GLASS summer school☆15Jun 15, 2016Updated 9 years ago
- ☆21Mar 6, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Video shot transition detection☆26Mar 9, 2023Updated 3 years ago
- ☆23Jan 31, 2025Updated last year
- [WACV 2026] LASER: Lip Landmark Assisted Speaker Detection for Robustness official implemntation☆26Feb 26, 2026Updated 2 months ago
- ☆10Sep 20, 2021Updated 4 years ago
- Marching Squares implementation for Processing based on https://github.com/murphydactyl/JavaKinectFingerTracker/☆13Sep 10, 2012Updated 13 years ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆32Dec 10, 2025Updated 5 months ago
- 🔥 [CVPR 2024] The official repo for Zero-Painter!☆70Jun 8, 2024Updated last year
- chatterbox TTS + Voice Clone using onnx☆28Dec 31, 2025Updated 4 months ago
- A WGAN-GP that utilizes a compositional pattern producing network as the generator☆11Sep 9, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Google Colab notebook to convert your low-resolution StyleGAN models into higher-resolution checkpoints so you can continue training!☆38Sep 26, 2019Updated 6 years ago
- ☆13Feb 23, 2018Updated 8 years ago
- Some mathematical extensions to SuperCollider☆16Jul 19, 2023Updated 2 years ago
- The official implementation of our ECCV 2024 publication, PYRA (Parallel Yielding Re-Activation).☆22Dec 19, 2025Updated 4 months ago
- Instance segmentation deep learning model built using PyTorch and fastai to classify rooftops and calculate their solar PV potential.☆15Jun 14, 2024Updated last year
- ☆32May 3, 2024Updated 2 years ago
- Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools an…☆13Mar 8, 2024Updated 2 years ago