lwpyh/CoS_codes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lwpyh/CoS_codes)

lwpyh / CoS_codes

CoS: Chain-of-Shot Prompting for Long Video Understanding

☆53

Alternatives and similar repositories for CoS_codes

Users that are interested in CoS_codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lwpyh / ProMaC_code
View on GitHub
[NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation
☆65Dec 1, 2024Updated last year
lwpyh / Awesome-MLLM-Reasoning-Collection
View on GitHub
A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.
☆36Jul 1, 2026Updated 3 weeks ago
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆36Feb 22, 2026Updated 5 months ago
V-STaR-Bench / V-STaR
View on GitHub
Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning
☆45Mar 2, 2026Updated 4 months ago
SunTongtongtong / Benchmark-Robustness-Text-Image-Compose-Retrieval
View on GitHub
☆13Apr 12, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆25Jan 26, 2025Updated last year
mayubo2333 / fewshot_ED
View on GitHub
ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View
☆11Mar 13, 2024Updated 2 years ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago
gyxxyg / VTG-LLM
View on GitHub
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
☆130Dec 10, 2024Updated last year
Pilhyeon / BAM-DETR
View on GitHub
Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'
☆36Feb 26, 2025Updated last year
mlvlab / OVQA
View on GitHub
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…
☆18Apr 23, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
fansunqi / VideoTool
View on GitHub
Official Repository for NeurIPS'25 Paper "Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task"
☆23May 18, 2026Updated 2 months ago
Raymond-sci / EMB
View on GitHub
Pytorch Implementation of ECCV'22 paper: Video Activity Localisation with Uncertainties in Temporal Boundary
☆17Jul 17, 2022Updated 4 years ago
ncTimTang / AKS
View on GitHub
[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding
☆228Dec 19, 2025Updated 7 months ago
64327069 / LVAgent
View on GitHub
Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
☆39Nov 24, 2025Updated 7 months ago
jongwoopark7978 / LVNet
View on GitHub
[Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.
☆44Feb 10, 2026Updated 5 months ago
iLearn-Lab / CVPR25-LION-FS
View on GitHub
[CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
☆29Dec 2, 2025Updated 7 months ago
fefergrgrgrg / smileyCoin
View on GitHub
simple web ui to manage mcp (model context protocol) servers in the claude app
☆103May 16, 2025Updated last year
zjxxxxxxxxx / unplugin-vue-source
View on GitHub
Add a __source prop to all Elements.
☆27Jul 17, 2024Updated 2 years ago
Hokhim2 / CVBench
View on GitHub
☆19Aug 28, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zengqunzhao / AIM-Fair
View on GitHub
[CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
☆17Mar 27, 2025Updated last year
WissingChen / CRA-GQA
View on GitHub
The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"
☆52Apr 27, 2025Updated last year
abyss219 / AI-Recycle-Helper
View on GitHub
☆71Oct 11, 2022Updated 3 years ago
zou-group / avatar
View on GitHub
(NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning
☆241Jun 10, 2025Updated last year
gitctrlx / xtrt
View on GitHub
A lightweight, high-performance deep learning inference tool.
☆51Dec 30, 2025Updated 6 months ago
VincentHancoder / AToM
View on GitHub
The official implementation of work "AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward".
☆19Mar 25, 2025Updated last year
QinMoXX / EffortlessFramework
View on GitHub
简单易用的前端Unity框架
☆22Aug 14, 2024Updated last year
chencjfeng / manage-system-server
View on GitHub
管理系统服务
☆26Jan 9, 2026Updated 6 months ago
aideink / game.snake
View on GitHub
a demo but fun snake game created in https://aide.ink
☆66Jan 15, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated last year
YuCao16 / CRDI
View on GitHub
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion
☆16Mar 14, 2025Updated last year
patrick-tssn / VSTAR
View on GitHub
[ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information
☆16Oct 27, 2024Updated last year
EvolvingLMMs-Lab / VideoMMMU
View on GitHub
Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos
☆72Sep 5, 2025Updated 10 months ago
tq1102 / tsdb
View on GitHub
☆30Oct 13, 2022Updated 3 years ago
chrisx599 / Video-Browser
View on GitHub
Official code repo of Video-Browser: Towards Agentic Open-web Video Browsing
☆28Jan 19, 2026Updated 6 months ago
www-Ye / Time-R1
View on GitHub
R1-like Video-LLM for Temporal Grounding
☆138Jun 20, 2025Updated last year