Egbert-Lannister / Robo-ImagineLinks
Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With Geometric And Dynamic Consistency Augmentation"
☆23Updated last month
Alternatives and similar repositories for Robo-Imagine
Users that are interested in Robo-Imagine are comparing it to the libraries listed below
Sorting:
- [2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation☆26Updated last month
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆12Updated 3 months ago
- ☆42Updated 2 months ago
- 用户面试平台☆18Updated last month
- vue3-elementPlus-admin,vue3-elementPlus-template☆31Updated this week
- This is a project about visual spatial reasoning.☆53Updated last week
- OTFS-channel-estimation☆26Updated 2 months ago
- Quantify and analyze distribution shifts in learning from samples.☆34Updated 2 weeks ago
- A Unified Driving World Model for Future Generation and Perception☆116Updated last month
- 这个算法用于无人机群避障一个加入机群的无人机,算法分为两种思路:(1)加入者的路径规划主动机动规避编队机群、(2)编队微调避让加入者。目前只做了第一种思路。唯一已知信息是原机群的运动轨迹F(x,y,z,t)|each plane,对于第一种思路:对于补位飞机唯一的输入参数是…☆17Updated last week
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆59Updated last week
- Implementation for "Challenger: Affordable Adversarial Driving Video Generation"☆114Updated last month
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆36Updated 3 weeks ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆208Updated 3 months ago
- Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"☆176Updated 3 weeks ago
- Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)☆24Updated 2 months ago
- SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆62Updated 2 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆58Updated this week
- (ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.☆70Updated 2 months ago
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆219Updated 2 months ago
- ☆23Updated last week
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆57Updated last week
- [ICCV 2025 Highlight] Panorama Generation as a Next-Token Prediction Task.☆44Updated last month
- [Arxiv] Discrete Diffusion in Large Language and Multimodal Models: A Survey☆269Updated last week
- A Gaussian dense reward framework for GUI grounding training☆218Updated last week
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆363Updated 2 weeks ago
- [CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation☆410Updated 4 months ago
- Official Repository of OmniCaptioner☆159Updated 4 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆241Updated 3 months ago
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆157Updated last month