LeapLabTHU / EchoWorldLinks
[CVPR 2025] EchoWorld: Learning Motion-Aware World Models for Echocardiography Probe Guidance
☆39Updated 9 months ago
Alternatives and similar repositories for EchoWorld
Users that are interested in EchoWorld are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning☆35Updated 9 months ago
- [NeurIPS'23] Uncertainty Estimation for Safety-critical Scene Segmentation via Fine-grained Reward Maximization☆18Updated last year
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆59Updated 6 months ago
- ☆59Updated last year
- [ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities☆37Updated 3 months ago
- ☆21Updated last month
- ☆10Updated 2 years ago
- Official implementation of "Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Ro…☆24Updated last year
- Implementation of ''VPUFormer: Visual Prompt Unified Transformer for Interactive Image Segmentation''☆15Updated 4 months ago
- ICLR 2023 and ICML 2023 paper☆22Updated last year
- [MedIA 2025] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation☆40Updated 5 months ago
- [MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…☆34Updated 2 months ago
- [NeurIPS 2023] Text Promptable Surgical Instrument Segmentation with Vision-Language Models☆43Updated 2 years ago
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆18Updated 2 months ago
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Updated 3 years ago
- Code repository for paper: "General surgery vision transformer: A video pre-trained foundation model for general surgery"☆45Updated last year
- [IPCAI'24 Best Paper] Advancing Surgical VQA with Scene Graph Knowledge☆45Updated 7 months ago
- The Official PyTorch Implementation of OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation☆34Updated last year
- ☆35Updated last year
- GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AI.☆85Updated 7 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90Updated 7 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- Pytorch Implementation for CVPR 2024 paper: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation☆56Updated 4 months ago
- Code for [MICCAI 2024] MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Seg…☆10Updated last year
- [NeurIPS'25][OralGPT & MMOral] The official repo of OralGPT & MMOral Bench.☆60Updated last week
- [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning☆45Updated last month
- [ICML2024]The official implementation of SemiRES in PyTorch.☆32Updated last year
- [ICRA 2025] Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion☆22Updated 7 months ago
- [MedIA 2026] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation☆26Updated last week
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Updated last year