The implementation of Sequential VLAD in Pytorch
☆20Jun 20, 2019Updated 6 years ago
Alternatives and similar repositories for seqvlad-pytorch
Users that are interested in seqvlad-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- Geometry Normalization Networks for Accurate Scene Text Detection (iccv 2019)☆21Apr 3, 2020Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 8 years ago
- ☆16Dec 17, 2018Updated 7 years ago
- Video to Language Challenge (MSR-VTT Challenge 2016)☆32Dec 28, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ICCV2019: Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network☆68Nov 19, 2019Updated 6 years ago
- Code for ''A Simple Baseline for Audio-Visual Scene-Aware Dialog``☆27May 26, 2020Updated 6 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification☆13Feb 5, 2022Updated 4 years ago
- Code for the paper "Interpreting video features: A comparison of 3D Convolutional networks and Convolutional LSTM networks"☆11Dec 14, 2020Updated 5 years ago
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.☆11Dec 20, 2018Updated 7 years ago
- Machine Translation Metrics Unit TesTing☆13Jun 4, 2016Updated 10 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- non local net based on caffe2☆11Nov 20, 2022Updated 3 years ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 5 years ago
- Official Tensorflow Implementation of the paper "Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning" in CVPR 2…☆151Jul 8, 2019Updated 6 years ago
- video captioning☆24Mar 14, 2019Updated 7 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- ☆12Feb 14, 2019Updated 7 years ago
- Caffe implementation of paper: "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering"☆29Oct 24, 2018Updated 7 years ago
- ☆33Apr 20, 2018Updated 8 years ago
- some models for video caption implemented by pytorch. (S2VT)☆23Feb 1, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- Pytorch code for our NeurIPS 2019 paper "Cross-channel Communication Networks"☆41Dec 13, 2019Updated 6 years ago
- Image Caption metrics: Bleu、Cider、Meteor、Rouge、Spice☆113Mar 2, 2019Updated 7 years ago
- A simple toolkit for processing event-based data.☆13Apr 7, 2026Updated 2 months ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆23Mar 25, 2026Updated 2 months ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated last year
- ☆21Jul 25, 2024Updated last year
- ☆12Nov 19, 2016Updated 9 years ago
- Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359☆15May 16, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Magnetic resonance imaging (MRI) images are known to be sparse. This is an implementation using non-convex penalty function that encourag…☆19Aug 10, 2019Updated 6 years ago
- An optimized version of SeqGAN in pytorch☆12Apr 24, 2018Updated 8 years ago
- Code release for Learning to Assemble Neural Module Tree Networks for Visual Grounding (ICCV 2019)☆39Nov 23, 2019Updated 6 years ago
- [MAC 2024] The baseline code for MAC 2024.☆12Jun 3, 2025Updated last year
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- pytorch implementation of video captioning☆401Aug 19, 2019Updated 6 years ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆10Apr 18, 2023Updated 3 years ago