☆34Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for TranS4mer
Users that are interested in TranS4mer are comparing it to the libraries listed below
Sorting:
- ☆139Jan 3, 2024Updated 2 years ago
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated 11 months ago
- ☆58Dec 2, 2025Updated 3 months ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆105Feb 14, 2023Updated 3 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆234May 20, 2024Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆38Aug 29, 2023Updated 2 years ago
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- ☆21Mar 22, 2023Updated 2 years ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 2 years ago
- TransNet V2: Shot Boundary Detection Neural Network☆880Dec 4, 2023Updated 2 years ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆24Aug 15, 2024Updated last year
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- Video shot transition detection☆25Mar 9, 2023Updated 3 years ago
- Character-aware audio-only subtitling☆31Jun 15, 2025Updated 8 months ago
- ☆21May 11, 2025Updated 9 months ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- repo for active speaker detection for media videos.☆31Nov 19, 2023Updated 2 years ago
- A fullstack Rust + React chat app using open-source Llama language models☆33Sep 8, 2023Updated 2 years ago
- Context Free Grammar(CFG) parser library and application written in Python.☆27Nov 22, 2023Updated 2 years ago
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)☆29Apr 1, 2024Updated last year
- ☆82Mar 10, 2025Updated 11 months ago
- A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios☆13Jan 24, 2024Updated 2 years ago
- ☆13Nov 21, 2025Updated 3 months ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆168Jan 30, 2025Updated last year
- Unofficial x64 GLICB 2.17 binaries for Node.js☆13Jun 21, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- ☆13May 17, 2025Updated 9 months ago
- A list of (detailed, non-stochastic) action potential models, with links to papers, source code, CellML and Myokit implementations☆11Feb 24, 2026Updated last week
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆42Apr 25, 2023Updated 2 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- Official code for PLoP☆17Updated this week
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- Official implementation for “SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain”☆21Dec 11, 2025Updated 2 months ago