EnVision-Research / A4-AgentLinks
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
☆28Updated last month
Alternatives and similar repositories for A4-Agent
Users that are interested in A4-Agent are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆119Updated 5 months ago
- Code of BRIDGE: Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation☆117Updated 4 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆87Updated 5 months ago
- [ICCV 2025] Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation☆55Updated 5 months ago
- ☆53Updated 2 months ago
- [ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation☆83Updated 4 months ago
- ☆69Updated 7 months ago
- Official implementation of "Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation"☆163Updated 2 weeks ago
- ☆39Updated 3 months ago
- [NeurIPS 2025] Styl3R: Instant 3D Stylized Reconstruction for Arbitrary Scenes and Styles☆102Updated 2 months ago
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- Local nonlinear causal attention latent diffusion models for visual story synthesizing☆29Updated 10 months ago
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆282Updated last month
- ☆26Updated 2 months ago
- ☆76Updated 4 months ago
- [NeurIPS 2025 spotlight] QFFT, Question-Free Fine-Tuning for Adaptive Reasoning☆91Updated 3 months ago
- Multimodal Referring Segmentation☆208Updated 2 weeks ago
- RealSee3D: A multi-view RGB-D dataset combining real-world captures and procedurally generated scenes, with extensible annotations for di…☆229Updated last month
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Updated last year
- 【ICLR 2026】 Official implementation of [OmniNav: A Unified Framework for Prospective Exploration and Visual-Language Navigation]☆71Updated last month
- Official repo for [NeurlPS 2025] "DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration"☆139Updated 9 months ago
- this is a tool and a displayer that allows us to place the 3D model and reshape them.☆14Updated 2 years ago
- ☆43Updated 2 weeks ago
- A Survey of Image Editing☆465Updated 5 months ago
- Code for paper "MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving"☆35Updated 3 weeks ago
- ☆118Updated 3 months ago
- 智能金融投资平台 (Finance Dashboard)☆34Updated last month
- Self-supervised graph diffusion encoder for spatial transcriptomics data (SCOPE-ST).☆31Updated 2 months ago
- build PyTorch with CUDA for Jetson Orin and Thor.☆32Updated 2 months ago
- [NeurIPS 2025] The official implementation of "MOTION: Multi-Sculpt Evolutionary Coarsening for Federated Continual Graph Learning"☆38Updated 2 months ago