[ECCV 2024] Official PyTorch implementation of "Classification Matters: Improving Video Action Detection with Class-Specific Attention"
☆17Nov 8, 2024Updated last year
Alternatives and similar repositories for class-query-vad
Users that are interested in class-query-vad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 8 months ago
- Implementation of Learning Instance-Aware Object Detection Using Determinantal Point Processes [https://arxiv.org/pdf/1805.10765.pdf]☆19Nov 21, 2023Updated 2 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- nnq_cnd_study stands for Neural Network Quantization & Compact Networks Design Study☆13Aug 31, 2020Updated 5 years ago
- ☆20Aug 18, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Deep learning tutorials using tensorflow☆22Oct 11, 2019Updated 6 years ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- [ICCV 2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆58Feb 2, 2026Updated last month
- Model predictive control under STL constraints☆33Nov 24, 2025Updated 4 months ago
- ☆30Nov 24, 2025Updated 4 months ago
- ☆33Nov 24, 2025Updated 4 months ago
- ☆30Sep 3, 2019Updated 6 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Implementation of Deep Elastic Network☆42Nov 24, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation of Video-MA2MBA☆12Dec 3, 2024Updated last year
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated 2 months ago
- Implementation of Unsupervised 3D Reconstruction Network☆47Nov 24, 2025Updated 4 months ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated 2 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated last year
- ☆14Apr 25, 2025Updated 11 months ago
- Large-Vocabulary Continuous Sign Language Recognition, 2024☆15May 30, 2024Updated last year
- Implementation of Tsallis Actor Critic method☆61Nov 24, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for "Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations" (CVPR 2024 Oral)☆18Jun 23, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆31Feb 28, 2026Updated last month
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Dec 29, 2022Updated 3 years ago
- ☆31Apr 21, 2025Updated 11 months ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…☆14Feb 19, 2019Updated 7 years ago
- ☆11Oct 13, 2024Updated last year
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆21Apr 3, 2024Updated last year
- Survey-on-Implicit-Neural-Representation☆36Mar 31, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆17Oct 4, 2025Updated 5 months ago
- Official Implementation of CODE☆17Sep 26, 2024Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…☆20Sep 26, 2024Updated last year
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆16Mar 18, 2026Updated last week
- Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)☆37Nov 5, 2021Updated 4 years ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- ☆11Aug 7, 2024Updated last year