Research works from Tencent AI Lab regarding self-evolving agents
☆86Jan 30, 2026Updated last month
Alternatives and similar repositories for SelfEvolvingAgent
Users that are interested in SelfEvolvingAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆48Jul 7, 2025Updated 8 months ago
- A fast and neat API for Conceptualization of Probase☆17Oct 28, 2019Updated 6 years ago
- Code for Negative Yields Positive: Unified Dual-Path Adapter for Vision-Language Models☆26Oct 29, 2024Updated last year
- [CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models☆57Jan 30, 2026Updated last month
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 6 months ago
- ☆10Oct 25, 2024Updated last year
- OmniGAIA: Towards Native Omni-Modal AI Agents☆82Mar 16, 2026Updated last week
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆103Mar 10, 2026Updated 2 weeks ago
- Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"☆59Oct 14, 2025Updated 5 months ago
- Code for the paper "Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering" (AAAI 2021)☆30Feb 19, 2021Updated 5 years ago
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆50Sep 21, 2025Updated 6 months ago
- Code release for our paper "Divide and Conquer: Language Models can Plan and Self-Correct for Compositional Text-to-Image Generation".☆18Jan 30, 2024Updated 2 years ago
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆35Dec 4, 2025Updated 3 months ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆18Jun 19, 2025Updated 9 months ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆28Nov 24, 2025Updated 4 months ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆25Apr 14, 2025Updated 11 months ago
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆39Jan 27, 2026Updated last month
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆74Dec 1, 2025Updated 3 months ago
- ☆27Jan 28, 2026Updated last month
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆42Feb 10, 2026Updated last month
- ☆18Mar 31, 2024Updated last year
- HallE-Control: Controlling Object Hallucination in LMMs☆31Apr 10, 2024Updated last year
- [ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging☆44Mar 16, 2026Updated last week
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆37Jan 8, 2025Updated last year
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆58Oct 16, 2025Updated 5 months ago
- ☆16May 23, 2023Updated 2 years ago
- Official code repository for the main conference paper in EMNLP 2022: SubeventWriter: Iterative Sub-event Sequence Generation with Cohere…☆11Oct 16, 2022Updated 3 years ago
- [AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)☆44Jun 2, 2025Updated 9 months ago
- ☆11Feb 22, 2023Updated 3 years ago
- ☆13Dec 9, 2024Updated last year
- [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆55Oct 9, 2025Updated 5 months ago
- ☆23Jun 5, 2025Updated 9 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Mar 4, 2025Updated last year
- [ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆181Jul 6, 2025Updated 8 months ago
- Coursework for Mathematics for Machine Learning (70015) at Imperial College London☆10Nov 12, 2024Updated last year
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Source code of the paper: Exploring Multi-View Pixel Contrast for General and Robust Image Forgery Localization, IEEE TIFS 2025.☆26Aug 8, 2025Updated 7 months ago
- Universal Video Temporal Grounding with Generative Multi-modal Large Language Models☆47Nov 25, 2025Updated 4 months ago
- ☆35Jun 3, 2025Updated 9 months ago