Video captioning on MSR-VTT Dataset
☆12Mar 21, 2021Updated 4 years ago
Alternatives and similar repositories for Video_Captioning_Pytorch
Users that are interested in Video_Captioning_Pytorch are comparing it to the libraries listed below
Sorting:
- SW components and demos for visual kinship recognition. An emphasis is put on the FIW dataset-- data loaders, benchmarks, results in summ…☆17Mar 13, 2023Updated 2 years ago
- A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.☆74Jul 30, 2023Updated 2 years ago
- [ECCV 2022] "Adversarial Contrastive Learning via Asymmetric InfoNCE"☆24Dec 12, 2022Updated 3 years ago
- Video Captioning on MSR-VTT and MSVD dataset using Deep Learning☆21Aug 14, 2020Updated 5 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- ☆33Apr 20, 2018Updated 7 years ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 11 months ago
- DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactu…☆10Oct 9, 2024Updated last year
- Project Page for CoPRS, offering training overview, inference code, and downloadable links.☆20Oct 27, 2025Updated 4 months ago
- ☆11Apr 6, 2019Updated 6 years ago
- PyTorch Implementation of the paper "Defining and Quantifying the Emergence of Sparse Concepts in DNNs" (CVPR 2023)☆12Dec 24, 2023Updated 2 years ago
- ☆10Sep 7, 2022Updated 3 years ago
- Decorrelate Irrelevant, Purify Relevant: Overcome Textual Spurious Correlations from a Feature Perspective☆11Nov 16, 2022Updated 3 years ago
- Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets☆12May 25, 2023Updated 2 years ago
- The implemented code of RAMEM, Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global Pixels.☆14Aug 16, 2023Updated 2 years ago
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆14Sep 25, 2025Updated 5 months ago
- Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-me…☆12Dec 25, 2025Updated 2 months ago
- Generating Human Skeletons with Mutual Actions☆11Oct 22, 2021Updated 4 years ago
- A simply deep learning based blur image detector.☆10Mar 29, 2023Updated 2 years ago
- ☆11Jul 19, 2022Updated 3 years ago
- ☆12Sep 8, 2022Updated 3 years ago
- Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations☆12Dec 10, 2024Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated 11 months ago
- ☆17Updated this week
- [CVPR 2023] Backdoor Defense via Adaptively Splitting Poisoned Dataset☆49Apr 8, 2024Updated last year
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- ☆10Jan 29, 2019Updated 7 years ago
- ☆10Jul 22, 2021Updated 4 years ago
- A TensorFlow Implementation of GraLSP: Graph Neural Networks with Local Structural Patterns, In AAAI, 2020.☆12Jun 25, 2020Updated 5 years ago
- ☆11Sep 11, 2023Updated 2 years ago
- Code used in the paper "Learning to Learn from Web Data through Deep Semantic Embeddings" ECCV 2018 MULA Workshop☆11Aug 1, 2018Updated 7 years ago
- One-Pixel Shortcut: on the Learning Preference of Deep Neural Networks (ICLR 2023 Spotlight)☆14Sep 28, 2025Updated 5 months ago
- ☆12Oct 2, 2023Updated 2 years ago
- ☆11Aug 7, 2024Updated last year
- An end-to-end framework for pulmonary airway analysis☆19Jan 19, 2026Updated last month
- Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton based Action Recognition☆11Aug 30, 2021Updated 4 years ago
- Comprehensive Information Integration Modeling Framework for Video Titling☆11Aug 27, 2020Updated 5 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated last year