[ECCV 2024 (Oral)] Towards Scene Graph Anticipation
☆19May 12, 2026Updated last week
Alternatives and similar repositories for SceneSayer
Users that are interested in SceneSayer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jun 12, 2025Updated 11 months ago
- Official code for "Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt" (AAAI2025)☆27May 27, 2025Updated 11 months ago
- Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023☆12Oct 3, 2023Updated 2 years ago
- ☆26Jun 5, 2025Updated 11 months ago
- ☆17Mar 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆14Jan 10, 2024Updated 2 years ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆50Jan 8, 2025Updated last year
- CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation☆20Feb 2, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 3 months ago
- A Holistic Embodied Cognition Benchmark☆19Apr 3, 2025Updated last year
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆12Aug 1, 2025Updated 9 months ago
- ☆14Oct 30, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV 2023] Official implementation of paper "SOAR: Scene-debiasing Open-set Action Recognition".☆12Dec 23, 2023Updated 2 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 11 months ago
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆25Sep 9, 2025Updated 8 months ago
- [2022 WACV] FastAno: Fast Anomaly Detection via Spatio-temporal Patch Transformation☆14Aug 7, 2023Updated 2 years ago
- A project using YoloV8 to detect License Plates☆13Sep 29, 2023Updated 2 years ago
- Awesome MLLMs/Benchmarks for Short/Long/Streaming Video Understanding☆66Sep 1, 2025Updated 8 months ago
- [CVPR 2024] Official Repository for MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes☆19Jul 8, 2024Updated last year
- [CVPR 2023] Code for the paper "Masked Images Are Counterfactual Samples for Robust Fine-tuning"☆14Mar 24, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021☆13Oct 26, 2021Updated 4 years ago
- ☆18Mar 9, 2023Updated 3 years ago
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆100Jul 27, 2025Updated 9 months ago
- Official Repository for the ICML 2023 paper "BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning"☆16Oct 11, 2023Updated 2 years ago
- NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment☆24Mar 10, 2024Updated 2 years ago
- [AAAI 2026 Oral] HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment☆30Dec 17, 2025Updated 5 months ago
- Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments (ECCV 2022)☆26Nov 12, 2024Updated last year
- ☆45Apr 14, 2023Updated 3 years ago
- A New Benchmark for Scene Graph Generation, targeting real-world applications☆148May 5, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning☆40Mar 12, 2025Updated last year
- ☆11Apr 19, 2022Updated 4 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- ☆19Aug 7, 2025Updated 9 months ago
- COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark☆15Aug 22, 2024Updated last year
- Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021☆213Aug 22, 2022Updated 3 years ago
- Pebble REBBLE watchface☆12Mar 3, 2025Updated last year