HKUDS / VideoAgentLinks
"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"
☆317Updated 2 months ago
Alternatives and similar repositories for VideoAgent
Users that are interested in VideoAgent are comparing it to the libraries listed below
Sorting:
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆556Updated 4 months ago
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆329Updated 10 months ago
- Babel - Open Multilingual Large Language Models Serving Over 90% of Global Speakers☆212Updated 9 months ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆215Updated 4 months ago
- LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and m…☆436Updated last month
- [arXiv'25] EraRAG: Efficient and Incremental Retrieval-Augmented Generation for Growing Corpora☆161Updated 2 months ago
- [ACL 2025 Findings] MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs https://aclanthology.org/202…☆233Updated last month
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Updated last week
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasks☆496Updated 3 months ago
- Tree Search for LLM Agent Reinforcement Learning☆252Updated 2 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆471Updated 2 months ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆160Updated 2 months ago
- NEO Series: Native Vision-Language Models from First Principles☆502Updated last month
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…☆309Updated 3 months ago
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆171Updated 5 months ago
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆820Updated last month
- DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approac…☆407Updated 7 months ago
- Official implementation of RARE: Retrieval-Augmented Reasoning Modeling☆185Updated 6 months ago
- Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/g…☆354Updated this week
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆158Updated 11 months ago
- [CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.1…☆594Updated 6 months ago
- ☆249Updated last month
- 🔥 OneThinker: All-in-one Reasoning Model for Image and Video☆319Updated last week
- Echo-4o☆458Updated last week
- **Deep Video Discovery (DVD)** is a deep-research style question answering agent designed for understanding extra-long videos.☆318Updated last month
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆306Updated 4 months ago
- Source code of LogicRAG at AAAI'26.☆150Updated this week
- MiroThinker is a series of open-source agentic models trained for deep research and complex tool use scenarios.☆1,314Updated last week
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆245Updated last month
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆457Updated last month