TencentARC/TVTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TencentARC/TVTS)

TencentARC / TVTS

Turning to Video for Transcript Sorting

☆49

Alternatives and similar repositories for TVTS

Users that are interested in TVTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiulu66 / EgoPlan-Bench2
View on GitHub
☆31Apr 11, 2025Updated last year
gimpong / WWW22-HCQ
View on GitHub
The code for the paper "Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval" (WWW'22, Oral).
☆17Mar 8, 2022Updated 4 years ago
lixiaotong97 / mc-BEiT
View on GitHub
[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference …
☆22Sep 13, 2022Updated 3 years ago
ChenYi99 / EgoPlan
View on GitHub
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆85Dec 6, 2024Updated last year
TencentARC / MCQ
View on GitHub
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆141Jul 20, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
huangmozhi9527 / ConMH
View on GitHub
[AAAI 2023] Contrastive Masked Autoencoders for Self-Supervised Video Hashing
☆26Jul 4, 2023Updated 3 years ago
TencentARC / ConMIM
View on GitHub
Official codes for ConMIM (ICLR 2023)
☆58Feb 8, 2023Updated 3 years ago
TencentARC / Divot
View on GitHub
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
☆87Feb 27, 2025Updated last year
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
gimpong / MM23-MISSRec
View on GitHub
The code for the paper "MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation" (AC…
☆63Mar 20, 2024Updated 2 years ago
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
zlab-princeton / UEval
View on GitHub
UEval: A Benchmark for Unified Multimodal Generation
☆24Apr 20, 2026Updated 3 months ago
AILab-CVC / SEED-Bench
View on GitHub
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
☆365Jan 14, 2025Updated last year
sail-sg / ptp
View on GitHub
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
☆150Jun 7, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
gimpong / AAAI22-MeCoQ
View on GitHub
The code for the paper "Contrastive Quantization with Code Memory for Unsupervised Image Retrieval" (AAAI'22, Oral).
☆37Oct 21, 2022Updated 3 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
HKUST-LongGroup / CoMM
View on GitHub
[CVPR 2025 Highlight] Official repository for CoMM Dataset
☆56Dec 31, 2024Updated last year
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
KuofengGao / ASD
View on GitHub
[CVPR 2023] Backdoor Defense via Adaptively Splitting Poisoned Dataset
☆50Apr 8, 2024Updated 2 years ago
gimpong / AAAI25-S5VH
View on GitHub
The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).
☆23Aug 2, 2025Updated 11 months ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
actcwlf / GSVC
View on GitHub
An Exploration with Entropy Constrained 3D Gaussians for 2D Video Compression (ICLR2025)
☆18Jul 8, 2025Updated last year
ssppp / Click4Caption
View on GitHub
A visual LLM for image region description or QA.
☆16Jul 14, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KuofengGao / CIBA
View on GitHub
[BMVC 2023] Backdoor Attack on Hash-based Image Retrieval via Clean-label Data Poisoning
☆17Sep 1, 2023Updated 2 years ago
gimpong / CVPR25-Condenser
View on GitHub
The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).
☆16Sep 25, 2025Updated 9 months ago
zjr2000 / LLMVA-GEBC
View on GitHub
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
☆29Jan 1, 2024Updated 2 years ago
EricLee8 / SPACE
View on GitHub
The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension
☆12Oct 23, 2022Updated 3 years ago
MikeWangWZHL / Paxion
View on GitHub
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆38May 23, 2023Updated 3 years ago
Kwai-YuanQi / TaskGalaxy
View on GitHub
Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types
☆32Jul 16, 2025Updated last year
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
yale-nlp / TOMATO
View on GitHub
☆41Nov 8, 2024Updated last year
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
antoyang / VidChapters
View on GitHub
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
☆211Nov 13, 2023Updated 2 years ago
szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
GX77 / TextKG
View on GitHub
☆11Jun 27, 2023Updated 3 years ago
JerryXu0129 / HyP2-Loss
View on GitHub
☆14Oct 10, 2022Updated 3 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
showlab / Awesome-Long-Context
View on GitHub
A curated list of resources about long-context in large-language models and video understanding.
☆32Aug 8, 2023Updated 2 years ago