The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 6 years ago
Alternatives and similar repositories for seqvlad-pytorch
Users that are interested in seqvlad-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Apr 3, 2020Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification☆13Feb 5, 2022Updated 4 years ago
- Code for the paper "Interpreting video features: A comparison of 3D Convolutional networks and Convolutional LSTM networks"☆11Dec 14, 2020Updated 5 years ago
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- A Pytorch implementation of "Reconstruction Network for Video Captioning", CVPR 2018☆53Apr 6, 2020Updated 6 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- VAE+GAN☆10Apr 18, 2018Updated 7 years ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- video captioning☆24Mar 14, 2019Updated 7 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- with reinforcement learning☆32May 19, 2020Updated 5 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- Visual Coreference Resolution in Visual Dialog using Neural Module Networks☆57Oct 12, 2021Updated 4 years ago
- ☆33Apr 20, 2018Updated 7 years ago
- Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.…☆26Nov 3, 2018Updated 7 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- Pytorch code for our NeurIPS 2019 paper "Cross-channel Communication Networks"☆41Dec 13, 2019Updated 6 years ago
- draw object rect and add some properties☆13May 28, 2018Updated 7 years ago
- Image Caption metrics: Bleu、Cider、Meteor、Rouge、Spice☆113Mar 2, 2019Updated 7 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 6 months ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆22Mar 25, 2026Updated 3 weeks ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 10 months ago
- ☆21Jul 25, 2024Updated last year
- ☆12Nov 19, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆15May 16, 2019Updated 6 years ago
- An optimized version of SeqGAN in pytorch☆12Apr 24, 2018Updated 7 years ago
- Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)☆39Nov 23, 2019Updated 6 years ago
- [MAC 2024] The baseline code for MAC 2024.☆12Jun 3, 2025Updated 10 months ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- pytorch implementation of video captioning☆400Aug 19, 2019Updated 6 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Apr 18, 2023Updated 2 years ago