jshi31/NAFAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jshi31/NAFAE)

jshi31 / NAFAE

Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Losses"

☆30

Alternatives and similar repositories for NAFAE

Users that are interested in NAFAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MichiganCOG / Video-Grounding-from-Text
View on GitHub
Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"
☆47Jun 22, 2024Updated 2 years ago
Sy-Zhang / TCMN-Release
View on GitHub
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆16Oct 22, 2022Updated 3 years ago
hyounghk / VideoQADenseCapFrameGate-ACL2020
View on GitHub
Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…
☆34May 14, 2020Updated 6 years ago
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
zfchenUnique / WSSTG
View on GitHub
This repository contains the main baselines introduced in WSSTG (ACL 2019).
☆57Jul 8, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
iworldtong / TALL.pytorch
View on GitHub
PyTorch implementation of "TALL: Temporal Activity Localization via Language Query. Gao et al. ICCV2017."
☆14Apr 20, 2019Updated 7 years ago
zfchenUnique / VID-Sentence
View on GitHub
This repository provides the dataset introduced by our WSSTG paper
☆13Jul 21, 2019Updated 7 years ago
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
JaywongWang / CBP
View on GitHub
Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware P…
☆59Mar 24, 2023Updated 3 years ago
doc-doc / vRGV
View on GitHub
Visual Relation Grounding in Videos (ECCV'20, Spotlight)
☆57Dec 8, 2022Updated 3 years ago
niluthpol / weak_supervised_video_moment
View on GitHub
Weakly Supervised Video Moment Retrieval from Text Queries
☆43Jul 20, 2020Updated 6 years ago
BonnieHuangxin / SLTA
View on GitHub
ACM ICMR 2019《Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal Attention》
☆36Jun 19, 2019Updated 7 years ago
liudaizong / CSMGAN
View on GitHub
Code for ACM MM2020 paper: Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
☆34Sep 3, 2020Updated 5 years ago
ych133 / How2R-and-How2QA
View on GitHub
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Oct 15, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
escorciav / moments-retrieval-page
View on GitHub
Moments Retrieval Project Webpage (temporal)
☆31Jan 17, 2024Updated 2 years ago
tzhhhh123 / HC-STVG
View on GitHub
The HC-STVG Dataset
☆65Apr 12, 2023Updated 3 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
iworldtong / Awesome-Temporal-Sentence-Grounding-in-Videos
View on GitHub
A curated list of grounding natural language in video and related area. :-)
☆82Dec 16, 2019Updated 6 years ago
youngfly11 / LCMCG-PyTorch
View on GitHub
AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"
☆58Oct 25, 2021Updated 4 years ago
ikuinen / semantic_completion_network
View on GitHub
☆26Aug 4, 2020Updated 5 years ago
crodriguezo / TMLGA
View on GitHub
Repository of proposal-free temporal moment localization work
☆33Jun 11, 2024Updated 2 years ago
jacobswan1 / MTG-pytorch
View on GitHub
Gender/Age attribute grounding using weak supervised manner.
☆12Jun 23, 2019Updated 7 years ago
zinengtang / DeCEMBERT
View on GitHub
Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)
☆17Jan 12, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
yytzsy / grounding_changing_distribution
View on GitHub
☆36Apr 14, 2021Updated 5 years ago
yytzsy / SCDM
View on GitHub
Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos
☆71Sep 7, 2021Updated 4 years ago
antoine77340 / S3D_HowTo100M
View on GitHub
S3D Text-Video model trained on HowTo100M using MIL-NCE
☆200Jul 3, 2020Updated 6 years ago
madhawav / MML
View on GitHub
Multi-faceted Video Moment Localizer
☆17Jun 19, 2020Updated 6 years ago
zbwglory / CMHSE
View on GitHub
The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch
☆20Apr 26, 2020Updated 6 years ago
yaohungt / Gated-Spatio-Temporal-Energy-Graph
View on GitHub
[CVPR'19] [PyTorch] Gated Spatio Temporal Energy Graph
☆153Feb 20, 2020Updated 6 years ago
ikuinen / CMIN_moment_retrieval
View on GitHub
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos
☆87Nov 22, 2020Updated 5 years ago
dazhang-cv / MAN
View on GitHub
This is the official repo for "MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment"
☆17May 27, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jayleicn / TVCaption
View on GitHub
[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
☆91Sep 6, 2023Updated 2 years ago
JaywongWang / TGN
View on GitHub
Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"
☆17Nov 21, 2022Updated 3 years ago
XgDuan / WSDEC
View on GitHub
Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised man…
☆104Mar 21, 2020Updated 6 years ago
jshi31 / awesome-language-guided-image-editing
View on GitHub
A list of papers and other resources on language-guided image editing.
☆39Jan 13, 2021Updated 5 years ago
BigRedT / info-ground
View on GitHub
Learning phrase grounding from captioned images through InfoNCE bound on mutual information
☆73Aug 22, 2020Updated 5 years ago
yytzsy / ABLR_code
View on GitHub
The source code of the paper: "To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression"
☆30Jan 8, 2019Updated 7 years ago
erobic / negative_analysis_of_grounding
View on GitHub
Shows visual grounding methods can be right for the wrong reasons! (ACL 2020)
☆23Jun 26, 2020Updated 6 years ago