Yutong-Zhou-cv / Awesome-Survey-PapersLinks

A curated list of Survey Papers on Deep Learning.

☆12

Alternatives and similar repositories for Awesome-Survey-Papers

Users that are interested in Awesome-Survey-Papers are comparing it to the libraries listed below

Sorting:

GewelsJI / MVLT
Masked Vision-Language Transformer in Fashion
☆36Updated 2 years ago
JerryYLi / svitt
Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"
☆20Updated 2 years ago
jhcho99 / CoFormer
[CVPR'22] Official PyTorch Implementation of "Collaborative Transformers for Grounded Situation Recognition"
☆50Updated 2 years ago
jeykigung / HiCLIP
☆30Updated 2 years ago
allenai / grit_official
Official repository for the General Robust Image Task (GRIT) Benchmark
☆54Updated 2 years ago
umd-huang-lab / perceptionCLIP
Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"
☆79Updated last year
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
weijiawu / Awesome-Synthetic-Data-for-Perception-Task
☆43Updated 2 years ago
shuheikurita / RefEgo
☆13Updated last year
OliverRensu / DeepMIM
[WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling
☆53Updated 5 months ago
Shahzadnit / EZ-CLIP
☆20Updated 5 months ago
callsys / TextVR
[PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension
☆28Updated last year
TencentARC / FLM
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆32Updated 2 years ago
LightDXY / BootMAE
ECCV2022,Bootstrapped Masked Autoencoders for Vision BERT Pretraining
☆97Updated 2 years ago
facebookresearch / HierVL
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
☆46Updated 2 years ago
Hritikbansal / videocon
☆57Updated last year
opendatalab / CLIP-Parrot-Bias
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆65Updated last year
facebookresearch / genecis
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆60Updated 2 years ago
Hxyou / MSCLIP
Official Code of ECCV 2022 paper MS-CLIP
☆90Updated 3 years ago
callsys / FlowText
[ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation
☆13Updated 2 years ago
iancovert / locality-alignment
☆53Updated 9 months ago
V3Det / Detectron2-V3Det
Detectron2 Toolbox and Benchmark for V3Det
☆18Updated last year
CASIA-IVA-Lab / Obj2Seq
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Updated 2 years ago
renwang435 / video-ttt-release
Test-Time Training on Video Streams
☆64Updated 2 years ago
QUVA-Lab / PIN
Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
☆26Updated 9 months ago
showlab / sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
☆75Updated 11 months ago
kirill-vish / Beyond-INet
Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"
☆101Updated last year
NVlabs / DICOD
Official Pytorch implementation for Distilling Image Classifiers in Object detection (NeurIPS2021)
☆32Updated 3 years ago
TencentARC / TVTS
Turning to Video for Transcript Sorting
☆48Updated 2 years ago
TencentARC / ViSFT
☆35Updated last year