amap-cvlab / MV-PainterLinks
☆244Updated 3 weeks ago
Alternatives and similar repositories for MV-Painter
Users that are interested in MV-Painter are comparing it to the libraries listed below
Sorting:
- Coherent Video Inpainting Using Optical Flow-Guided Efficient Diffusion☆287Updated 2 months ago
- (AAAI 2025)MUSES: 3D-Controllable Image Generation via Multi-Modal Agent Collaboration☆34Updated last month
- JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent☆499Updated this week
- PixelHacker: Image Inpainting with Structural and Semantic Consistency☆434Updated last month
- SCoralDet and SCoralDet Dataset☆123Updated 2 months ago
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆215Updated last month
- A fast gigapixel processing system☆2,008Updated 7 months ago
- ☆306Updated last week
- AIDoctor training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Mod…☆274Updated 4 months ago
- [ECAI 2024] MoSt-DSA: Modeling Motion and Structural Interactions for Direct Multi-Frame Interpolation in DSA Images☆11Updated 7 months ago
- [Arxiv 2025] Official Implementation for "A Novel Benchmark and Dataset for Efficient 3D Gaussian Splatting with Gaussian Point Cloud Com…☆54Updated 2 weeks ago
- ☆566Updated last month
- Moxin is a family of fully open-source and reproducible LLMs☆598Updated 2 weeks ago
- 生产级iOS网络通信、架构实战 基于 CocoaAsyncSocket 打造的高性能底层通信框架,日均处理10万+消息,真实服务于企业客户!来源于多年IM开发经验总结,经过生产环境验证(已脱敏),完整呈现从单TCP架构到企业级多路复用架构的演进之路。☆741Updated last week
- Native-resolution diffusion Transformer☆274Updated last month
- EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video (ICRA2025)☆301Updated 7 months ago
- R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization☆403Updated 2 weeks ago
- ☆204Updated 3 months ago
- EDA-Q is a full-stack EDA tool for superconducting quantum chip design, including topology optimization, equivalent circuit calculation, …☆532Updated last week
- Segment Anything 2 for Surgical Video Segmentation☆387Updated 3 months ago
- Multi-Reward as Condition for Instruction-Based Image Editing☆52Updated 3 months ago
- 【CVPR 2025 Highlight】MonSter: Marry Monodepth to Stereo Unleashes Power☆617Updated last week
- [CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation☆72Updated 3 weeks ago
- [NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding☆150Updated 4 months ago
- [CVPR‘ 2025 ] JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration☆202Updated last week
- Official implementation of OpenWBT.☆654Updated 3 weeks ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆133Updated 2 weeks ago
- [ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs☆123Updated last month
- (ICLR 2025) The official pytorch implementation of "UNIP: Rethinking Pre-trained Attention Patterns for Infrared Semantic Segmentation"☆21Updated 2 months ago
- Oriented Bounding Box (OBB) -based Instance Segmentation (MIR 2025)☆82Updated 7 months ago