CoS: Chain-of-Shot Prompting for Long Video Understanding
☆53Feb 13, 2025Updated last year
Alternatives and similar repositories for CoS_codes
Users that are interested in CoS_codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆63Dec 1, 2024Updated last year
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆581Mar 8, 2026Updated 2 weeks ago
- ☆12Jun 26, 2024Updated last year
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆21Aug 1, 2025Updated 7 months ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last month
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated last year
- 小程序技术实现,攻克小程序技术。view和js 分离,参考vue的实现方式。主要技术栈:ts/express/android/ast/vnode☆29Nov 22, 2023Updated 2 years ago
- ☆19Jun 10, 2025Updated 9 months ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆17Jul 17, 2022Updated 3 years ago
- By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the …☆30May 26, 2025Updated 9 months ago
- simple web ui to manage mcp (model context protocol) servers in the claude app☆104May 16, 2025Updated 10 months ago
- Add a __source prop to all Elements.☆27Jul 17, 2024Updated last year
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 3 weeks ago
- (NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning☆238Jun 10, 2025Updated 9 months ago
- ☆46May 21, 2025Updated 10 months ago
- ☆72Oct 11, 2022Updated 3 years ago
- 新数据洞察方式☆1,005Jun 25, 2025Updated 8 months ago
- 简单易用的前端Unity框架☆22Aug 14, 2024Updated last year
- Chain-of-Frames [CVPR 2026]☆38Jul 2, 2025Updated 8 months ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆24Jan 26, 2025Updated last year
- ☆30Oct 13, 2022Updated 3 years ago
- 管理系统服务☆26Jan 9, 2026Updated 2 months ago
- nodeStack is a full-stack framework for JavaScript developers. It enables you to create high-performance, high-quality programs with mini…☆98Jan 23, 2026Updated 2 months ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆15Jan 15, 2025Updated last year
- a demo but fun snake game created in https://aide.ink☆66Jan 15, 2025Updated last year
- GlucoInsight:Framework for Glucose Management Application☆84Aug 6, 2024Updated last year
- A light and general database connection pool tool☆24Sep 5, 2023Updated 2 years ago
- Introducing "ait," "aiself," and "aits"—new pronouns for AI systems. This repo provides definitions and examples to promote their use in …☆159May 21, 2024Updated last year
- ☆30Sep 2, 2023Updated 2 years ago
- Firmware for a 100W DC Electronic Load based on STM32F405 and LVGL (Keil MDK Project).☆498Jun 25, 2025Updated 8 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 6 months ago
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- PhishIntention: Phishing detection through webpage intention☆257Jan 5, 2026Updated 2 months ago