alibaba-mmai-research/HiCo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba-mmai-research/HiCo)

alibaba-mmai-research / HiCo

CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency

☆18

Alternatives and similar repositories for HiCo

Users that are interested in HiCo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

minjoong507 / Consistency-of-Video-LLM
View on GitHub
[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"
☆16Oct 13, 2025Updated 9 months ago
naver-ai / class-query-vad
View on GitHub
[ECCV 2024] Official PyTorch implementation of "Classification Matters: Improving Video Action Detection with Class-Specific Attention"
☆18Nov 8, 2024Updated last year
renjie-liang / HUAL
View on GitHub
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning
☆15Dec 12, 2023Updated 2 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 4 years ago
alibaba-mmai-research / Masked-Action-Recognition
View on GitHub
Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition
☆32Dec 7, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
minjoong507 / BM-DETR
View on GitHub
[WACV 2025] Official Pytorch code for "Background-aware Moment Detection for Video Moment Retrieval"
☆16Feb 24, 2025Updated last year
mengcaopku / LocVTP
View on GitHub
[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Jul 29, 2022Updated 4 years ago
mbzuai-oryx / LongShOT
View on GitHub
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
☆21Jun 20, 2026Updated last month
Siddhantmest / Facial-Action-Unit-Detection
View on GitHub
Predicting FAU intensities to determine the type of emotion.
☆15Apr 17, 2021Updated 5 years ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
Kangningthu / SUM
View on GitHub
Uncertainty-aware Fine-tuning of Segmentation Foundation Models (NeurIPS 2024).
☆16Jan 9, 2025Updated last year
BoPang1996 / PGT
View on GitHub
Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021
☆30Mar 30, 2021Updated 5 years ago
NJU-LINK / IF-VidCap
View on GitHub
The Source Code for IF-VidCap @ICLR 2026
☆19Oct 22, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Mark12Ding / FAME
View on GitHub
[CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
☆51Sep 30, 2023Updated 2 years ago
franciszzj / VLPrompt
View on GitHub
[IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation
☆28Sep 24, 2024Updated last year
KHU-VLL / DEVIAS
View on GitHub
[ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"
☆29Nov 15, 2025Updated 8 months ago
Chuhanxx / Temporal_Query_Networks
View on GitHub
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 4 years ago
JamesLiang819 / Instance_Unique_Querying
View on GitHub
[NeurIPS 2022 Spotlight] Learning Equivariant Segmentation with Instance-Unique Querying
☆22Dec 17, 2022Updated 3 years ago
laura-wang / video_repres_sts
View on GitHub
Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Stati…
☆50Feb 9, 2021Updated 5 years ago
Yui010206 / Ego2Web
View on GitHub
[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
☆29Mar 25, 2026Updated 4 months ago
mayu-ot / hidden-challenges-MR
View on GitHub
codes for Uncovering Hidden Challenges in Query-Based Video Moment Retrieval
☆20Sep 7, 2020Updated 5 years ago
afcedf / SOONet
View on GitHub
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
☆30Jun 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AIM3-RUC / VideoIC
View on GitHub
Danmuku dataset
☆12Jul 7, 2023Updated 3 years ago
gurkirt / corrected-UCF101-Annots
View on GitHub
☆83Feb 20, 2021Updated 5 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
zhang-can / UP-TAL
View on GitHub
[CVPR2022] Unsupervised Pre-training for Temporal Action Localization Tasks (UP-TAL)
☆29Mar 9, 2022Updated 4 years ago
YYJMJC / Compositional-Temporal-Grounding
View on GitHub
☆31Mar 24, 2022Updated 4 years ago
merlresearch / SMART
View on GitHub
Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"
☆11Aug 10, 2023Updated 2 years ago
pritamqu / CrissCross
View on GitHub
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
☆26Jul 11, 2023Updated 3 years ago
nuggy875 / NeRF_pytorch_paeng
View on GitHub
Reimplementation of NeRF (Neural Radiance Fields) (ECCV2020)
☆10May 4, 2023Updated 3 years ago
google-research-datasets / maverics
View on GitHub
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…
☆13Feb 18, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
HumamAlwassel / TSP
View on GitHub
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)
☆119Sep 16, 2023Updated 2 years ago
huiwon-jang / CoordTok
View on GitHub
☆38Feb 6, 2025Updated last year
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
yytzsy / grounding_changing_distribution
View on GitHub
☆36Apr 14, 2021Updated 5 years ago
lambert-x / video-semisup
View on GitHub
Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)
☆30Dec 1, 2022Updated 3 years ago
Pilhyeon / BAM-DETR
View on GitHub
Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'
☆36Feb 26, 2025Updated last year