guanxiongsun/vfe.pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guanxiongsun/vfe.pytorch)

guanxiongsun / vfe.pytorch

Video Feature Enhancement with PyTorch

☆32

Alternatives and similar repositories for vfe.pytorch

Users that are interested in vfe.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

L-KID / Video-object-detection-by-location-anticipation
View on GitHub
The official implementation of our ICCV 2023 paper "Objects do not disappear: Video object detection by single-frame object location anti…
☆33Oct 1, 2023Updated 2 years ago
Duckduckgod / MAMBA
View on GitHub
Self implementation of AAAI21 paper MAMBA for video object detection.
☆13May 3, 2022Updated 4 years ago
RUCAIBox / LMM-Searcher
View on GitHub
The official code of "Towards Long-horizon Agentic Multimodal Search"
☆27Apr 17, 2026Updated 3 months ago
siyuanliii / SLAck
View on GitHub
Official Implementation of ECCV2024 paper: SLAck
☆29Sep 18, 2024Updated last year
LiuYuML / NT-VOT211
View on GitHub
[ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…
☆16Dec 30, 2025Updated 6 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
micts / acgcn
View on GitHub
Code for the paper "Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection"
☆14Aug 18, 2021Updated 4 years ago
WHU-xjs / LSTFE-Net
View on GitHub
CVPR23-Video Small Object Detection with Long Short-Term Feature Enhancement Network
☆27May 21, 2024Updated 2 years ago
UESTC-nnLab / Tridos
View on GitHub
[TGRS 24] Triple-domain Feature Learning with Frequency-aware Memory Enhancement for Moving Infrared Small Target Detection
☆35May 30, 2025Updated last year
zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
Fsoft-AIC / Z-GMOT
View on GitHub
[NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking
☆12May 19, 2026Updated 2 months ago
sdroh1027 / DiffusionVID
View on GitHub
Official Repository of the paper "DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection"
☆50May 31, 2024Updated 2 years ago
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Hon-Wong / PTSEFormer
View on GitHub
[ECCV 2022] PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection
☆38Nov 3, 2022Updated 3 years ago
snap-research / MyVLM
View on GitHub
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
☆188Jul 5, 2024Updated 2 years ago
yuecao0119 / MMFuser
View on GitHub
The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …
☆63Nov 5, 2024Updated last year
Azong-HQU / MMTrack
View on GitHub
The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].
☆24Dec 13, 2023Updated 2 years ago
ycWang9725 / WSTAN
View on GitHub
☆16Dec 21, 2021Updated 4 years ago
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
ZhuHaoranEIS / Orthogonal-FGOD
View on GitHub
☆11Mar 4, 2026Updated 4 months ago
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago
MCG-NJU / EVAD
View on GitHub
[ICCV 2023] Efficient Video Action Detection with Token Dropout and Context Refinement
☆39Sep 27, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SJTU-LuHe / TransVOD
View on GitHub
The repository is the code for the paper "End-to-End Video Object Detection with Spatial-TemporalTransformers"
☆248Oct 12, 2023Updated 2 years ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
chenshihfang / GOT
View on GitHub
Can we make visual tracking systems align more closely with human visual perception?
☆44Jul 13, 2026Updated 2 weeks ago
skhcjh231 / MATR_codebase
View on GitHub
☆22Mar 7, 2025Updated last year
ZhuHaoranEIS / Point-Teacher
View on GitHub
Robust End-to-end Point-Supervised Tiny Object Detection
☆13Aug 12, 2025Updated 11 months ago
wuyi2020 / DoRM
View on GitHub
[NeurIPS 2023] Official pytorch implementation of "Domain Re-Modulation for Few-Shot Generative Domain Adaption"
☆13Aug 2, 2024Updated last year
Bomps4 / Multi_Resolution_Rescored_ByteTrack
View on GitHub
☆11Mar 30, 2026Updated 3 months ago
ussaema / Vector_Matrix_CapsuleGAN
View on GitHub
Implementation in the framework of my bachelor thesis: Generative Modelling using Capsule Generative Adversarial Networks
☆12Feb 20, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Little-Podi / GRM
View on GitHub
[CVPR'23] The official PyTorch implementation of our CVPR 2023 paper: "Generalized Relation Modeling for Transformer Tracking".
☆88Dec 30, 2023Updated 2 years ago
zhaozeyang108 / Oriented-DETR
View on GitHub
The Project of ECCV 2024 Oral Paper "Oriented Object Detection vis Point-Axis Representation"
☆79Dec 12, 2024Updated last year
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
TRAILab / JDT3D
View on GitHub
Code for JDT3D (ECCV 2024 Paper)
☆17Sep 30, 2024Updated last year
Run542968 / GAP
View on GitHub
☆11Oct 13, 2024Updated last year
sankalpmittal1911 / Mask-RCNN-3D-Implementation
View on GitHub
This is the extension of Mask RCNN model to 3D images.
☆12May 7, 2019Updated 7 years ago
weijianan1 / NVI
View on GitHub
[ECCV2024] Nonverbal Interaction Detection
☆31Oct 30, 2024Updated last year