yeliudev / nncoreLinks

📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.

☆28

Alternatives and similar repositories for nncore

Users that are interested in nncore are comparing it to the libraries listed below

Sorting:

mengcaopku / LocVTP
[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Updated 2 years ago
NeverMoreLCH / Awesome-Video-Grounding
A reading list of papers about Visual Grounding.
☆31Updated 2 years ago
mzhaoshuai / CenterCLIP
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…
☆132Updated 3 years ago
LiuRicky / ts2_net
[ECCV2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval
☆78Updated 2 years ago
jinhyunj / EaTR
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆50Updated last year
jingwangsg / MS-DETR
An official implementation for MS-DETR in ACL'23
☆17Updated 2 years ago
ju-chen / Efficient-Prompt
☆193Updated 2 years ago
antoyang / TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆181Updated last year
j-min / HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆102Updated 5 months ago
jy0205 / STCAT
[NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding
☆52Updated last year
showlab / all-in-one
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
☆283Updated 2 years ago
linjieli222 / HERO_Video_Feature_Extractor
Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"
☆110Updated 4 years ago
ZhenZHAO / awesome-video-moment-retrieval
paper list on Video Moment Retrieval (VMR), or Natural Language Video Localization (NLVL), or Temporal Sentence Grounding in Videos (TSGV…
☆31Updated 2 years ago
26hzhang / ReLoCLNet
Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)
☆58Updated 3 years ago
haojc / ShufflingVideosForTSG
Code for ECCV 2022 paper "Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding"
☆29Updated 2 years ago
foolwood / DRL
[arXiv22] Disentangled Representation Learning for Text-Video Retrieval
☆96Updated 3 years ago
r-cui / ViGA
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022
☆69Updated 3 years ago
DCDmllm / Momentor
☆76Updated 7 months ago
StanLei52 / GEBD
[ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation
☆69Updated 3 years ago
SCZwangxiao / Temporal-Language-Grounding-in-videos
Temporal Moment(Action) Localization via Language / Temporal Language Grounding / Video Moment Retrieval
☆98Updated 3 years ago
tzhhhh123 / HC-STVG
The HC-STVG Dataset
☆56Updated 2 years ago
epic-kitchens / C5-Multi-Instance-Retrieval
☆11Updated 2 years ago
TencentARC / MCQ
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆139Updated 2 years ago
ioanacroi / qb-norm
Cross Modal Retrieval with Querybank Normalisation
☆55Updated last year
TencentARC / UMT
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …
☆222Updated last year
sangminwoo / Explore-And-Match
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …
☆42Updated 2 years ago
jayleicn / moment_detr
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
☆315Updated last year
wjun0830 / CGDETR
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…
☆133Updated 10 months ago
26hzhang / VSLNet
Span-based Localizing Network for Natural Language Video Localization (ACL 2020)
☆107Updated 3 years ago
farewellthree / STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆104Updated last year