md-mohaiminul/TranS4mer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/md-mohaiminul/TranS4mer)

md-mohaiminul / TranS4mer

☆34

Alternatives and similar repositories for TranS4mer

Users that are interested in TranS4mer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kakaobrain / bassl
View on GitHub
☆142Jan 3, 2024Updated 2 years ago
ExMorgan-Alter / NeighborNet
View on GitHub
This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.
☆29Mar 19, 2025Updated last year
md-mohaiminul / ViS4mer
View on GitHub
☆58Dec 2, 2025Updated 7 months ago
Annusha / LIReC
View on GitHub
Learning Interactions and Relationships between Movie Characters (CVPR'20)
☆22Apr 12, 2023Updated 3 years ago
TencentYoutuResearch / SceneSegmentation-SCRL
View on GitHub
Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"
☆112Feb 14, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
anyirao / SceneSeg
View on GitHub
Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
☆237May 20, 2024Updated 2 years ago
patrick-tssn / VSTAR
View on GitHub
[ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information
☆16Oct 27, 2024Updated last year
MCG-NJU / TemporalPerceiver
View on GitHub
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
☆39Aug 29, 2023Updated 2 years ago
kampta / PatchGame
View on GitHub
PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021
☆24Jun 4, 2021Updated 5 years ago
NewsNet-Benchmark / NewsNet
View on GitHub
☆21Mar 22, 2023Updated 3 years ago
FeiT-FeiTeng / OAFuser
View on GitHub
☆10Sep 3, 2024Updated last year
Gorilla-Lab-SCUT / OrthDNNs
View on GitHub
Code for OrthDNNs: Orthogonal Deep Neural Networks
☆14Jan 9, 2020Updated 6 years ago
Skyline-9 / Visionary-Vids
View on GitHub
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated 2 years ago
soCzech / TransNetV2
View on GitHub
TransNet V2: Shot Boundary Detection Neural Network
☆1,003Dec 4, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
fy-vision / DiGA
View on GitHub
Official PyTorch implementation of "DiGA: Distil to Generalize and then Adapt for Domain Adaptive Semantic Segmentation" (CVPR 2023)
☆29Apr 1, 2024Updated 2 years ago
TencentYoutuResearch / HighlightDetection-CLC
View on GitHub
Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
bytedance / Shot2Story
View on GitHub
A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
☆179Jan 30, 2025Updated last year
JaesungHuh / simple-subtitling
View on GitHub
Character-aware audio-only subtitling
☆31Jun 15, 2025Updated last year
JunweiZheng93 / MATERobot
View on GitHub
Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments", ICRA 2024, Best …
☆16Mar 26, 2025Updated last year
KPeng9510 / RelaMiX
View on GitHub
☆19Aug 13, 2024Updated last year
substratusai / helm
View on GitHub
☆18Aug 19, 2024Updated last year
sming256 / AdaTAD
View on GitHub
[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆42Jul 9, 2024Updated 2 years ago
KPeng9510 / OS-SAR
View on GitHub
☆16May 14, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
springkim / WSpring
View on GitHub
windows setup script
☆11Jan 22, 2023Updated 3 years ago
Blank-Wang / DCASE2018-Task4
View on GitHub
Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data
☆11Oct 31, 2018Updated 7 years ago
mysee1989 / GraphJigsaw
View on GitHub
Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition
☆10Jul 1, 2022Updated 4 years ago
rash1993 / movie-asd
View on GitHub
repo for active speaker detection for media videos.
☆31Nov 19, 2023Updated 2 years ago
erosenfeld / disagree_discrep
View on GitHub
Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.
☆10Feb 27, 2024Updated 2 years ago
danilhendrasr / video-decoding-benchmark
View on GitHub
Compare NVIDIA Video Codec SDK's, PyAV's, and OpenCV's performance on video decoding.
☆13Dec 18, 2022Updated 3 years ago
3DCoMPaT200 / 3DCoMPaT200
View on GitHub
☆15Feb 13, 2025Updated last year
usc-sail / mica-MovieCLIP
View on GitHub
This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
☆43Oct 1, 2023Updated 2 years ago
kampta / PatchVAE
View on GitHub
PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020
☆14Apr 9, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
InuyashaYang / AIDIY
View on GitHub
JoinAI是一个开源仓库，专注于算法工程能力的培养，包括工程和数学原理的整理
☆11Apr 20, 2025Updated last year
JaesungHuh / av-diarization
View on GitHub
Audio-visual diarization pipeline used for creating VoxConverse dataset
☆22Jun 6, 2025Updated last year
facebookresearch / DVDialogues
View on GitHub
Code for DVD A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue
☆14Oct 12, 2021Updated 4 years ago
ArtemisWang / blind_movies
View on GitHub
为视障人群生成电影，输入是电影剧本和mkv格式电影，输出为带有解说的电影
☆12Jul 28, 2019Updated 7 years ago
AidanCooper / constrained-decoding
View on GitHub
A guide to structured generation using constrained decoding
☆18Jun 9, 2024Updated 2 years ago
wdzhao123 / FBNet
View on GitHub
FBNet code for FGOC in aerial images
☆15Jun 9, 2022Updated 4 years ago
yingutk / u2MDN
View on GitHub
Demo code for 'Unsupervised and Unregistered Hyperspectral Image Super-Resolution with Mutual Dirichlet-Net'.
☆13Mar 14, 2023Updated 3 years ago