UCSC-VLAA/Image-Pretraining-for-Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UCSC-VLAA/Image-Pretraining-for-Video)

UCSC-VLAA / Image-Pretraining-for-Video

[ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".

☆19

Alternatives and similar repositories for Image-Pretraining-for-Video

Users that are interested in Image-Pretraining-for-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCSC-VLAA / RobustCNN
View on GitHub
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
☆144Jan 23, 2023Updated 3 years ago
UCSC-VLAA / AdvXL
View on GitHub
[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
☆20Apr 21, 2024Updated 2 years ago
yuyinzhou / L2B
View on GitHub
This repository includes the official project of L2B, from our paper "Learning to Bootstrap for Combating Label Noise".
☆32Mar 16, 2025Updated last year
aijinrjinr / MLB-Seg
View on GitHub
☆14Jul 2, 2024Updated 2 years ago
UCSC-VLAA / EVP
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
☆42Apr 30, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
UCSC-VLAA / EarthWhere
View on GitHub
☆16Nov 15, 2025Updated 8 months ago
UCSC-VLAA / CRATE-alpha
View on GitHub
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
☆47Jun 3, 2024Updated 2 years ago
UCSC-VLAA / FedConv
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…
☆25Apr 30, 2024Updated 2 years ago
haojinw0027 / MedFrameQA
View on GitHub
MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning
☆18Jun 6, 2025Updated last year
OliverRensu / MVG
View on GitHub
☆61Jun 18, 2024Updated 2 years ago
OliverRensu / SDMP
View on GitHub
☆19Jan 2, 2023Updated 3 years ago
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
UCSC-VLAA / MedVLSynther
View on GitHub
[ICLR'26] MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
☆19Nov 1, 2025Updated 8 months ago
bairdzhang / des
View on GitHub
☆19Mar 27, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
UCSC-VLAA / Recap-DataComp-1B
View on GitHub
[ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
☆152Jun 13, 2024Updated 2 years ago
UCSC-VLAA / DMAE
View on GitHub
[CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
☆109Jul 24, 2023Updated 3 years ago
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
Sha-Lab / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆16Apr 22, 2019Updated 7 years ago
wangf3014 / Adventurer
View on GitHub
☆29Feb 27, 2025Updated last year
mirthAI / MicroSegNet
View on GitHub
☆20Jan 23, 2024Updated 2 years ago
UCSC-VLAA / EpiFoundation
View on GitHub
Pytorch implementation of EpiFoundation
☆26Feb 25, 2025Updated last year
UCSC-VLAA / AttnGCG-attack
View on GitHub
[TMLR 2025] Official implementation of AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation
☆27Jun 17, 2025Updated last year
UCSC-VLAA / ClinSeekAgent
View on GitHub
☆30Jun 1, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
UCSC-VLAA / ReasoningEval
View on GitHub
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆43Jun 6, 2025Updated last year
UCSC-VLAA / CLIPA
View on GitHub
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
☆322Jun 3, 2024Updated 2 years ago
UCSC-VLAA / MeDiM
View on GitHub
☆32Dec 1, 2025Updated 7 months ago
Pillercottrer / radcap_project
View on GitHub
☆19Mar 19, 2019Updated 7 years ago
UCSC-VLAA / o1_medical
View on GitHub
☆48Feb 26, 2025Updated last year
UCSC-VLAA / STAR-1
View on GitHub
[AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
☆38Apr 7, 2025Updated last year
mesunhlf / UPC-tf
View on GitHub
☆45May 8, 2020Updated 6 years ago
UCSC-VLAA / m1
View on GitHub
[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
☆51Dec 21, 2025Updated 7 months ago
UCSC-VLAA / MicroDiffusion
View on GitHub
[CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for …
☆55May 13, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
vyom-os / SAM2-Implementation-ReactNative
View on GitHub
Implementing ONNX runtime for android to run Segment Anything Model 2
☆12Aug 1, 2025Updated 11 months ago
ytongbai / ViTs-vs-CNNs
View on GitHub
[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)
☆180Dec 9, 2021Updated 4 years ago
singlaayush / MINIT
View on GitHub
This repository houses the official implementation of Multiple Instance NeuroImage Transformer (MINiT) paper, accepted at PRedictive Inte…
☆16Aug 23, 2022Updated 3 years ago
UCSC-VLAA / MedVLThinker
View on GitHub
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
☆60Dec 21, 2025Updated 7 months ago
OliverRensu / DeepMIM
View on GitHub
[WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling
☆56May 10, 2025Updated last year
pimed / RAPHIA
View on GitHub
☆14Apr 15, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago