showlab / Show-Anything-3DLinks

Edit and Generate Anything in 3D world!

☆13

Alternatives and similar repositories for Show-Anything-3D

Users that are interested in Show-Anything-3D are comparing it to the libraries listed below

Sorting:

Huage001 / Paint-Anything
An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.
☆35Updated 2 years ago
zhangjiewu / awesome-t2i-eval
A curated list of papers and resources for text-to-image evaluation.
☆29Updated last year
OpenGVLab / Awesome-DragGAN
Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN
☆85Updated last year
eric-ai-lab / Discffusion
Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"
☆29Updated last year
AniAggarwal / ecad
Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆25Updated 3 weeks ago
simonsanvil / DALL-E-Explained
Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…
☆33Updated 2 years ago
neuralchen / Bivolution
Accepted by AAAI2022
☆21Updated 3 years ago
ml-jku / semantic-image-text-alignment
☆24Updated 2 years ago
yumingj / GroupDiff
☆10Updated last year
vgbench / VGBench
☆14Updated 9 months ago
kyegomez / KosmosG
My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"
☆14Updated 8 months ago
facebookresearch / MoCA
Motion-conditional image animation for video editing
☆20Updated last year
top-yun / SPARK
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆18Updated 6 months ago
aimagelab / Alfie
Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)
☆30Updated 10 months ago
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆34Updated last year
ziplab / SN-Netv2
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆27Updated last year
WeihuangLin / INF-LLaVA
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Updated 11 months ago
danaesavi / ImageChain
This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…
☆12Updated last month
HanSolo9682 / CounterCurate
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆18Updated last year
ethanhe42 / dds
DDS: Delta Denoising Score PyTorch implementation
☆19Updated last year
aszala / VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Updated last year
JosephPai / Art-Description
Code for paper <Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation> in ICCV 2021.
☆13Updated 3 years ago
NVlabs / SSOD
Self-Supervised Object Detection via Generative Image Synthesis
☆28Updated 3 years ago
jsonBackup / Latent-CLIP-Demo
☆14Updated 4 months ago
JunjieYang97 / Meta-ControlNet
☆30Updated last year
juletx / spatial-reasoning
Grounding Language Models for Compositional and Spatial Reasoning
☆17Updated 2 years ago
DCDmllm / HyperLLaVA
Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
☆28Updated last year
helblazer811 / oracle-guided-image-synthesis
This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".
☆24Updated 3 years ago
kyegomez / MAGVIT2
Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"
☆15Updated 8 months ago
Vision-CAIR / Infinibench
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆15Updated last month