guoweiyu / Logic-in-FramesLinks
Logic-in-frames: Dynamic keyframe search via visual semantic-logical verification for long video understanding
☆57Updated last month
Alternatives and similar repositories for Logic-in-Frames
Users that are interested in Logic-in-Frames are comparing it to the libraries listed below
Sorting:
- ☆224Updated 2 months ago
- [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…☆195Updated this week
- UR2: Unify RAG and Reasoning through Reinforcement Learning☆126Updated last month
- Gotta Hear Them All: Towards Sound Source Aware Audio Generation.☆67Updated last month
- The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…☆47Updated 7 months ago
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆231Updated 2 months ago
- ☆55Updated last month
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆455Updated 2 weeks ago
- INFTY Engine: An Optimization Toolkit to Support Continual AI☆566Updated 3 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…☆169Updated 2 months ago
- ☆104Updated 3 months ago
- Official repository of MMGenBench☆120Updated 10 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆469Updated 3 weeks ago
- A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.☆64Updated last month
- A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).☆106Updated 2 months ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆185Updated 6 months ago
- On Predictability of Reinforcement Learning Dynamics for Large Language Models☆50Updated last month
- Beyond log-likelihood: exploring alternative objectives for supervised fine-tuning of language model post-training☆54Updated 3 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆66Updated 2 weeks ago
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 2 months ago
- [USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models☆108Updated 4 months ago
- a iOS network debug library ,It can monitor HTTP requests within the App and displays information related to the request.☆15Updated 8 years ago
- your finance bro Agent for trading and investing☆107Updated 2 months ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆518Updated 6 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 4 months ago
- Marco Search Agent for Realistic and Challenging Agentic Search☆240Updated 2 months ago
- Unified Semantic Curation Face (USCFace): An RDF Curation & Visualization System☆38Updated 5 months ago