mayhugotong / VideoINSTAView external linksLinks
This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatial-Temporal Reasoning
☆24Nov 15, 2024Updated last year
Alternatives and similar repositories for VideoINSTA
Users that are interested in VideoINSTA are comparing it to the libraries listed below
Sorting:
- Repository for the CVPR23 paper Re^2TAL☆13Nov 21, 2025Updated 2 months ago
- This is the official impletation repository of NAACL findings paper, GenTKG: Generative Forecasting on Temporal Knowledge Graph with Larg…☆61Oct 27, 2025Updated 3 months ago
- ☆22Jun 6, 2025Updated 8 months ago
- ☆22Mar 7, 2025Updated 11 months ago
- ☆20Apr 2, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆55Jul 5, 2024Updated last year
- ☆26Aug 4, 2020Updated 5 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- ☆39Jun 28, 2023Updated 2 years ago
- ☆13Aug 28, 2024Updated last year
- Linux configurations for themes, utilities, and layouts.☆11Oct 4, 2024Updated last year
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆51Jul 9, 2022Updated 3 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 3 months ago
- A userscript to insert citations from Zotero into ShareLaTeX as you write☆11Feb 26, 2024Updated last year
- [IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…☆26Nov 11, 2025Updated 3 months ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 8 months ago
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- quagga☆10Apr 7, 2020Updated 5 years ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- ☆16Oct 9, 2024Updated last year
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- ☆11Nov 21, 2022Updated 3 years ago
- fft/ifft transformations, DCT encoding/decoding using various techniques (zig-zag scanning, quantization), SNR calculations, filters (Gau…☆12Apr 30, 2018Updated 7 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated last year
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- ☆28Jul 18, 2025Updated 6 months ago
- Generates spectrogram from images☆13Apr 26, 2021Updated 4 years ago
- ☆47Apr 25, 2024Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- Federated Learning of Diffusion Models☆12Aug 30, 2023Updated 2 years ago
- ☆16May 12, 2025Updated 9 months ago
- Create your own custom MoViNet model using your custom data. You can train and inference from "a single of code".☆10Jun 1, 2023Updated 2 years ago
- Hides images into sound☆13Apr 26, 2021Updated 4 years ago
- ☆12Jan 17, 2024Updated 2 years ago
- ☆11Jan 8, 2025Updated last year
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- detailed notes for PointNet☆11Oct 23, 2020Updated 5 years ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆24Aug 8, 2025Updated 6 months ago