CoS: Chain-of-Shot Prompting for Long Video Understanding
☆53Feb 13, 2025Updated last year
Alternatives and similar repositories for CoS_codes
Users that are interested in CoS_codes are comparing it to the libraries listed below
Sorting:
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆62Dec 1, 2024Updated last year
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆570Feb 22, 2026Updated last week
- ☆18Jun 10, 2025Updated 8 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last week
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago
- ☆12Jun 26, 2024Updated last year
- 小程序技术实现,攻克小程序技术。view和js 分离,参考vue的实现方式。主要技术栈:ts/express/android/ast/vnode☆29Nov 22, 2023Updated 2 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 10 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 11 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆17May 15, 2025Updated 9 months ago
- LMM for VQA, tcsvt version☆11Jul 19, 2024Updated last year
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 8 months ago
- By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the …☆30May 26, 2025Updated 9 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- ☆12Jan 10, 2025Updated last year
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆22Feb 13, 2025Updated last year
- ☆46May 21, 2025Updated 9 months ago
- 简单易用的前端Unity框架☆22Aug 14, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆43Jul 26, 2024Updated last year
- (NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning☆238Jun 10, 2025Updated 8 months ago
- ☆30Oct 13, 2022Updated 3 years ago
- simple web ui to manage mcp (model context protocol) servers in the claude app☆103May 16, 2025Updated 9 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 2 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- ☆19Jun 29, 2025Updated 8 months ago
- Add a __source prop to all Elements.☆27Jul 17, 2024Updated last year
- a demo but fun snake game created in https://aide.ink☆66Jan 15, 2025Updated last year
- ☆72Oct 11, 2022Updated 3 years ago
- 管理系统服务☆26Jan 9, 2026Updated last month
- Control LLM☆22Apr 6, 2025Updated 10 months ago
- A light and general database connection pool tool☆24Sep 5, 2023Updated 2 years ago
- A Chatbot with UI design is created, according to some certain datasets (can be replaced). Through statistical analysis and PINN model, i…☆27May 28, 2025Updated 9 months ago
- Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary☆17Jul 17, 2022Updated 3 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆23Jun 26, 2025Updated 8 months ago