showlab / Show-Anything-3D
Edit and Generate Anything in 3D world!
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Show-Anything-3D
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆23Updated 9 months ago
- Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)☆12Updated last year
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆27Updated 6 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆32Updated 8 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated 11 months ago
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆47Updated this week
- ☆15Updated 3 months ago
- ☆17Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆39Updated 3 months ago
- Official Repository of Personalized Visual Instruct Tuning☆24Updated 2 weeks ago
- Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)☆22Updated 2 months ago
- A visual LLM for image region description or QA.☆14Updated last year
- Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"☆31Updated last month
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆22Updated 10 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆26Updated 2 weeks ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated last year
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆24Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆32Updated 5 months ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆23Updated 4 months ago
- ☆28Updated 10 months ago
- The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022☆10Updated 2 years ago
- ☆19Updated last year
- Accepted by AAAI2022☆21Updated 2 years ago
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆24Updated 3 months ago
- ☆20Updated last month
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆35Updated 2 weeks ago