twelvelabs-io/video-embeddings-evaluation-framework

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/twelvelabs-io/video-embeddings-evaluation-framework)

twelvelabs-io / video-embeddings-evaluation-framework

Pytorch implementation of Twelve Labs' Video Foundation Model evaluation framework & open embeddings

☆36

Alternatives and similar repositories for video-embeddings-evaluation-framework

Users that are interested in video-embeddings-evaluation-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HyeongminLEE / Tensorflow_Pix2Pix
View on GitHub
Study Friendly Implementation of Pix2Pix in Tensorflow
☆13Sep 8, 2018Updated 7 years ago
twelvelabs-io / pegasus-1-eval
View on GitHub
Repository for evaluating Pegasus-1 and video-language foundation models
☆14Nov 12, 2024Updated last year
sjpark5800 / LA-DETR
View on GitHub
[WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
☆14Sep 18, 2025Updated 10 months ago
FriedRonaldo / PsyNet
View on GitHub
Official Implementation of "PsyNet: Self-supervised Approach to Object Localization Using Point Symmetric Transformation"
☆25Dec 8, 2022Updated 3 years ago
FriedRonaldo / Primitives-PS
View on GitHub
Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data - Official PyTorch Implementati…
☆34Nov 14, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
orrzohar / LOVM
View on GitHub
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Feb 3, 2024Updated 2 years ago
Hypnosx / Kinetics-TPS
View on GitHub
ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.
☆13Jun 4, 2021Updated 5 years ago
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
kangreen0210 / LIME
View on GitHub
Accelerating the development of large multimodal models (LMMs) with lmms-eval
☆14Oct 14, 2024Updated last year
markendo / downscaling_intelligence
View on GitHub
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
☆25Mar 21, 2026Updated 4 months ago
philippe-eecs / vitok
View on GitHub
☆34May 14, 2025Updated last year
Wolfda95 / MIRP_Benchmark
View on GitHub
MICCAI 25 Publication: Your other Left! Vision-Language Models Fail to Identify Relative Positions in Medical Images
☆15May 11, 2026Updated 2 months ago
zehanwang01 / OmniBind
View on GitHub
☆34Apr 11, 2025Updated last year
yeung-lab / Micro-Bench
View on GitHub
A Vision-Language Benchmark for Microscopy Understanding
☆31Mar 13, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
xxyzll / UMB
View on GitHub
UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)
☆12May 26, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
Hon-Wong / Elysium
View on GitHub
[ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM
☆88Oct 25, 2024Updated last year
RAIVNLab / CREPE
View on GitHub
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Apr 27, 2023Updated 3 years ago
baidut / PatchVQ
View on GitHub
Patch-VQ: ‘Patching Up’ the Video Quality Problem
☆75May 7, 2026Updated 2 months ago
yashbhalgat / Contrastive-Lift
View on GitHub
[NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"
☆73Nov 3, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆58Aug 19, 2025Updated 11 months ago
arunbalajeev / gaze-interface
View on GitHub
Web Interface for gaze recording: CVPR 2018
☆10Jul 10, 2018Updated 8 years ago
sniklaus / arxiv-doom
View on GitHub
a parody of the ever-increasing amount of papers that appear on arXiv
☆38May 31, 2026Updated last month
fL0n9 / SKFAC-MindSpore
View on GitHub
SKFAC Preconditioner for MindSpore
☆12Jul 2, 2021Updated 5 years ago
SMILE-data / SMILE
View on GitHub
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 3 years ago
naver-ai / dual-teacher
View on GitHub
Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"
☆53Nov 16, 2023Updated 2 years ago
bpiyush / rotation-equivariant-lfm
View on GitHub
Rotation equivariance meets local feature matching
☆18Oct 20, 2022Updated 3 years ago
llyx97 / TempCompass
View on GitHub
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆133Apr 4, 2025Updated last year
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
tttyuntian / vlm_lexical_grounding
View on GitHub
PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"
☆11Sep 26, 2021Updated 4 years ago
won-bae / rethinkingCAM
View on GitHub
Official implementation of Rethinking Class Activation Mapping for Weakly Supervised Object Localization (ECCV 2020)
☆22Mar 11, 2021Updated 5 years ago
OpenMask3D / openmask3d.github.io
View on GitHub
☆11May 8, 2024Updated 2 years ago
StanfordVL / atp-video-language
View on GitHub
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (…
☆51May 29, 2024Updated 2 years ago
csliujw / swin-upernet
View on GitHub
☆17Dec 23, 2021Updated 4 years ago
aszala / VPEval
View on GitHub
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Nov 29, 2023Updated 2 years ago