SODA: Story Oriented Dense Video Captioning Evaluation Framework
☆14May 3, 2024Updated last year
Alternatives and similar repositories for SODA
Users that are interested in SODA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)☆230Jan 3, 2024Updated 2 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- ☆13Apr 2, 2025Updated 11 months ago
- Dense video captioning in PyTorch☆41Aug 30, 2019Updated 6 years ago
- Implementation codes for NeurIPS23 paper "Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts"☆14Mar 19, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Event Sequence Generation Network☆14Jun 22, 2021Updated 4 years ago
- an improvement of the paper: Learning to Detect Violent Videos using Convolution LSTM☆11Jun 1, 2020Updated 5 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- https://avocado-captioner.github.io/☆31Oct 16, 2025Updated 5 months ago
- ☆12Feb 4, 2023Updated 3 years ago
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆14Jan 22, 2025Updated last year
- dmne is an algorithm to learn node representations from multi-network data.☆14Aug 1, 2018Updated 7 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 3 months ago
- ☆13Mar 21, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Apr 4, 2025Updated 11 months ago
- Video classification using convGRU☆13Feb 15, 2018Updated 8 years ago
- Image Caption workout with NIC and NBT☆15Apr 5, 2019Updated 6 years ago
- The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆12Aug 19, 2021Updated 4 years ago
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Apr 4, 2021Updated 4 years ago
- A music commentary generator (OpenAI Scholar final project)☆14Oct 25, 2018Updated 7 years ago
- ☆15Apr 8, 2022Updated 3 years ago
- The Source Code for OmniVideoBench @ICLR 2026☆69Feb 12, 2026Updated last month
- Implementation for the project: Variational Image Captioning Using Deterministic Attention☆13Dec 14, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Sep 23, 2017Updated 8 years ago
- Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"☆26Dec 16, 2025Updated 3 months ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- The HC-STVG Dataset☆63Apr 12, 2023Updated 2 years ago
- A curated collection of resources, papers, and methods related to Community Detection in complex networks☆23Dec 4, 2024Updated last year
- Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).☆202Jun 8, 2022Updated 3 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Jan 23, 2025Updated last year
- Generate chinese couplet with seq2seq & PyTorch☆21Jul 4, 2019Updated 6 years ago
- ☆20Mar 17, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Microsoft COCO Caption Evaluation Tool - Python 3☆33May 23, 2019Updated 6 years ago
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 3 weeks ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆91Apr 7, 2025Updated 11 months ago
- NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)☆30Jul 18, 2023Updated 2 years ago
- ☆29Apr 8, 2020Updated 5 years ago
- Official implementation of "An Action Is Worth Multiple Words: Handling Ambiguity in Action Recognition", BMVC 2022☆12Dec 16, 2022Updated 3 years ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆29Sep 12, 2024Updated last year