☆83May 6, 2025Updated 9 months ago
Alternatives and similar repositories for neptune
Users that are interested in neptune are comparing it to the libraries listed below
Sorting:
- ☆20Nov 28, 2024Updated last year
- ☆12Nov 13, 2024Updated last year
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆18Apr 28, 2024Updated last year
- Chain-of-Frames [CVPR 2026]☆38Jul 2, 2025Updated 8 months ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- ☆26Apr 26, 2025Updated 10 months ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated last month
- Repository for the CVPR23 paper Re^2TAL☆13Nov 21, 2025Updated 3 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 7 months ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago
- Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆14Jan 3, 2024Updated 2 years ago
- ☆17Oct 22, 2024Updated last year
- ☆17Jun 20, 2025Updated 8 months ago
- NestJS project template, configured with prisma and ejs☆12Dec 1, 2024Updated last year
- ☆21Jul 9, 2025Updated 7 months ago
- This script automates the process of unlocking Apple ID accounts by solving captcha challenges, verifying account details, and resetting …☆13Jan 24, 2026Updated last month
- Progetto per la prova finale di Ingegneria del Software 2023-2024 al Politecnico di Milano☆10Oct 19, 2024Updated last year
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆44Sep 12, 2024Updated last year
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆40May 30, 2025Updated 9 months ago
- Quick Long Video Understanding [TMLR2025]☆76Oct 27, 2025Updated 4 months ago
- WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning (CVPR 2026)☆55Dec 30, 2025Updated 2 months ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Oct 30, 2024Updated last year
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆31Sep 27, 2025Updated 5 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- Code to generate datasets used in "How Useful is Self-Supervised Pretraining for Visual Tasks?"☆22Apr 13, 2020Updated 5 years ago
- Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"☆154Jun 23, 2025Updated 8 months ago
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆21Nov 15, 2024Updated last year
- ☆14Sep 10, 2024Updated last year
- API de mapeo para la Universidad de El Salvador (UES), desarrollada por estudiantes de la Facultad Multidisciplinaria Oriental. Proporcio…☆16Oct 3, 2025Updated 4 months ago
- ☆26Feb 20, 2025Updated last year
- Official repository for "IntentQA: Context-aware Video Intent Reasoning" from ICCV 2023.☆23Nov 29, 2024Updated last year
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆37Jan 30, 2026Updated last month
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆25Jul 17, 2023Updated 2 years ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆87Jul 13, 2025Updated 7 months ago
- ☆27Mar 21, 2024Updated last year