JerryYLi/svitt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JerryYLi/svitt)

JerryYLi / svitt

Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"

☆21

Alternatives and similar repositories for svitt

Users that are interested in svitt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jayleicn / singularity
View on GitHub
[ACL 2023] Official PyTorch code for Singularity model in "Revealing Single Frame Bias for Video-and-Language Learning"
☆136May 5, 2023Updated 3 years ago
wwfnb / Laser
View on GitHub
☆16Sep 16, 2025Updated 10 months ago
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
doc-doc / CoVGT
View on GitHub
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
☆20Mar 9, 2024Updated 2 years ago
adxcreative / EERCF
View on GitHub
Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
☆21Feb 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ruotianluo / refexp-comprehension
View on GitHub
Referring expression comprehension on ReferIt(RefClef)
☆10Nov 28, 2016Updated 9 years ago
davidpengucf / SFDAHPE
View on GitHub
☆11Aug 17, 2023Updated 2 years ago
hee0624 / process_image
View on GitHub
generate noise image 生成噪声图片，用来cv领域
☆14Feb 9, 2021Updated 5 years ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
hjy-u / ETOG
View on GitHub
[ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
☆13Feb 7, 2025Updated last year
ASU-APG / awesome_attribution_of_generative_models
View on GitHub
☆10Oct 18, 2021Updated 4 years ago
dahyun-kang / cst
View on GitHub
[CVPR'23] Official PyTorch implementation of Distilling Self-Supervised Vision Transformers for Weakly-Supervised Few-Shot Classification…
☆46Oct 22, 2023Updated 2 years ago
joelgrus / snowmeth
View on GitHub
use AI to write a novel using the snowflake method
☆17Jul 23, 2025Updated last year
phseo / PAN
View on GitHub
Progressive Attention Networks
☆12Oct 25, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago
NielsRogge / coco-eval
View on GitHub
A tiny package supporting distributed computation of COCO metrics for PyTorch models.
☆15Feb 28, 2023Updated 3 years ago
Kroangine-Xia / Design-of-a-Gesture-Recognition-based-Robotic-Arm-Control-System
View on GitHub
☆15Jun 19, 2024Updated 2 years ago
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
YonghaoHe / DSLA
View on GitHub
official code for Dynamic Smooth Label Assignment
☆12Oct 5, 2022Updated 3 years ago
csiro-icvg / Diff3DHPE
View on GitHub
Diff3DHPE: A Diffusion Model for 3D Human Pose Estimation [R6D 2023] [Official]
☆15May 23, 2024Updated 2 years ago
xiaofanustc / ME-Sampler
View on GitHub
[ECCV2020] Motion-excited Sampler: Video Adversarial Attack with Sparked Prior
☆11Nov 7, 2020Updated 5 years ago
uvadlc / uvadlc_practicals_2021
View on GitHub
Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition
☆10Oct 31, 2022Updated 3 years ago
hs1ang-hsu / BLAPose
View on GitHub
Official implementation of "Enhancing 3D Human Pose Estimation with Bone Length Adjustment"
☆19Jan 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Murf-y / Attractors-Simulation
View on GitHub
Multiple Attractors simulation with customization
☆14Feb 22, 2026Updated 5 months ago
jpthu17 / EMCL
View on GitHub
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
☆148Apr 9, 2024Updated 2 years ago
IVRL / PoGaIN
View on GitHub
(SP Letters 2022) PoGaIN: Poisson-Gaussian Image Noise Modeling from Paired Samples
☆16Feb 3, 2023Updated 3 years ago
SteveTsui / IDa-Det
View on GitHub
☆12Apr 3, 2023Updated 3 years ago
jha-lab / LinGen
View on GitHub
☆30Jun 9, 2025Updated last year
yl3800 / TranSTR
View on GitHub
☆12Dec 15, 2023Updated 2 years ago
randxie / mmdetection-tvm
View on GitHub
mmdetection -> TVM
☆15Aug 22, 2020Updated 5 years ago
qjy981010 / cocoapi
View on GitHub
COCO API Customized for OVIS evaluation
☆17Nov 8, 2021Updated 4 years ago
moewiee / RSNA2020-Team-VinBDI-MedicalImaging
View on GitHub
☆10Oct 27, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bastianwandt / ElePose
View on GitHub
☆19Mar 28, 2022Updated 4 years ago
xuguohai / X-CLIP
View on GitHub
An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"
☆185Apr 6, 2024Updated 2 years ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
Jorsorokin / ChaosAnalysis
View on GitHub
A matlab package for analyzing chaotic properties of time series data
☆11Jun 29, 2018Updated 8 years ago
lixin666 / C2SNet
View on GitHub
Contour Knowledge Transfer for Salient Object Detection
☆21Jun 15, 2019Updated 7 years ago
sauradip / fewshotQAT
View on GitHub
[BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action Localization using Query Adaptive Transformers"
☆21Jul 12, 2022Updated 4 years ago
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year