☆34Jun 2, 2023Updated 2 years ago
Alternatives and similar repositories for TranS4mer
Users that are interested in TranS4mer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆139Jan 3, 2024Updated 2 years ago
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated last year
- ☆58Dec 2, 2025Updated 3 months ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆106Feb 14, 2023Updated 3 years ago
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TransNet V2: Shot Boundary Detection Neural Network☆899Dec 4, 2023Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- ☆21Mar 22, 2023Updated 3 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆24Jun 4, 2021Updated 4 years ago
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆20May 14, 2024Updated last year
- ☆24Sep 24, 2023Updated 2 years ago
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆24Aug 15, 2024Updated last year
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆23Mar 13, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 3 years ago
- Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)☆29Apr 1, 2024Updated last year
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- Character-aware audio-only subtitling☆31Jun 15, 2025Updated 9 months ago
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆17May 23, 2024Updated last year
- [CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames☆41Jul 9, 2024Updated last year
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- ☆18Aug 19, 2024Updated last year
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies☆42Oct 1, 2023Updated 2 years ago
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Apr 13, 2023Updated 2 years ago
- 为视障人群生成电影,输入是电影剧本和mkv格式电影,输出为带有解说的电影☆12Jul 28, 2019Updated 6 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Compare NVIDIA Video Codec SDK's, PyAV's, and OpenCV's performance on video decoding.☆12Dec 18, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020☆14Apr 9, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue☆14Oct 12, 2021Updated 4 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- ☆14Sep 11, 2025Updated 6 months ago
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Apr 5, 2024Updated last year
- The implementation code of AAAI 2020 paper "Pixel-aware Deep Function-mixture Network for Spectral Super-Resolution".