The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 6 years ago
Alternatives and similar repositories for seqvlad-pytorch
Users that are interested in seqvlad-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 4 years ago
- Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification☆13Feb 5, 2022Updated 4 years ago
- Code for the paper "Interpreting video features: A comparison of 3D Convolutional networks and Convolutional LSTM networks"☆11Dec 14, 2020Updated 5 years ago
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 5 years ago
- non local net based on caffe2☆11Nov 20, 2022Updated 3 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- VAE+GAN☆10Apr 18, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆152Jul 8, 2019Updated 6 years ago
- video captioning☆24Mar 14, 2019Updated 7 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- with reinforcement learning☆32May 19, 2020Updated 5 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- ☆33Apr 20, 2018Updated 7 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- 仿喜马拉雅录音剪切的view☆15Oct 28, 2025Updated 4 months ago
- draw object rect and add some properties☆13May 28, 2018Updated 7 years ago
- Image Caption metrics: Bleu、Cider、Meteor、Rouge、Spice☆113Mar 2, 2019Updated 7 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Apr 16, 2025Updated 11 months ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 9 months ago
- ☆21Jul 25, 2024Updated last year
- ☆12Nov 19, 2016Updated 9 years ago
- tensor-train tensor completion (T3C), which is based on tt decomposition and gradient descent.☆12Jun 27, 2018Updated 7 years ago
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆15May 16, 2019Updated 6 years ago
- Magnetic resonance imaging (MRI) images are known to be sparse. This is an implementation using non-convex penalty function that encourag…☆20Aug 10, 2019Updated 6 years ago