HKUDS / VideoAgentLinks
"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"
☆408Updated 3 months ago
Alternatives and similar repositories for VideoAgent
Users that are interested in VideoAgent are comparing it to the libraries listed below
Sorting:
- [EMNLP2025] "GraphAgent: Agentic Graph Language Assistant"☆337Updated 11 months ago
- A general AI agent framework that can be adapted to various tasks and environments.☆150Updated 11 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆489Updated 3 months ago
- (ACL-2025 main conference) SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automat…☆316Updated 5 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆394Updated 11 months ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆219Updated 5 months ago
- [ACL 2025 Findings] MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs https://aclanthology.org/202…☆235Updated 2 months ago
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆840Updated 2 months ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆560Updated 6 months ago
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆164Updated 3 months ago
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆178Updated 6 months ago
- Official implementation of RARE: Retrieval-Augmented Reasoning Modeling☆186Updated 8 months ago
- LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and m…☆468Updated last week
- Tree Search for LLM Agent Reinforcement Learning☆273Updated 4 months ago
- Babel - Open Multilingual Large Language Models Serving Over 90% of Global Speakers☆212Updated 10 months ago
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆480Updated 2 months ago
- Echo-4o☆479Updated last month
- DocAgent is a system designed to generate high-quality, context-aware code documentation for Python codebases using a multi-agent approac…☆413Updated 9 months ago
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆158Updated last year
- ✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images☆547Updated last month
- Scaling In-context Learning from Few-shot to 1,024-shot on Tabular ML☆59Updated last month
- ARGO is an open-source AI Agent platform that brings Local Manus to your desktop. With one-click model downloads, seamless closed LLM int…☆462Updated 3 weeks ago
- [NeurIPS 2025] A Graph-based LLM Framework for Real-world SE Tasks☆517Updated 4 months ago
- [EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (RLVR for Search with Minimal Data)☆811Updated 2 months ago
- A curated list of awesome leaderboard-oriented resources for AI domain☆302Updated 2 weeks ago
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆298Updated 2 months ago
- NEO Series: Native Vision-Language Models from First Principles☆633Updated 3 weeks ago
- [arXiv'25] EraRAG: Efficient and Incremental Retrieval-Augmented Generation for Growing Corpora☆169Updated 4 months ago
- Open-source SOTA multi-image editing model☆845Updated this week
- ☆253Updated last month