cfeng16/GPS2Pix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cfeng16/GPS2Pix)

cfeng16 / GPS2Pix

[CVPR 2025] GPS as a Control Signal for Image Generation

☆25

Alternatives and similar repositories for GPS2Pix

Users that are interested in GPS2Pix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wufeim / LychSim
View on GitHub
A controllable and interactive simulation framework for vision research.
☆16May 25, 2026Updated last month
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
Jyxarthur / shot-by-shot
View on GitHub
[ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…
☆24May 16, 2026Updated 2 months ago
google-deepmind / wyd-benchmark
View on GitHub
☆28Mar 3, 2025Updated last year
lambert-x / VideoAuteur
View on GitHub
VideoAuteur: Towards Long Narrative Video Generation
☆44Oct 22, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kaist-ami / FPRF
View on GitHub
[AAAI'24] Official PyTorch implementation of the paper "FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radianc…
☆18Nov 29, 2024Updated last year
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
cvlab-kaist / Vid-CamEdit
View on GitHub
Official Implementation of "Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry"
☆32Nov 10, 2025Updated 8 months ago
cvlab-kaist / DiffTrack
View on GitHub
[NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"
☆99Dec 3, 2025Updated 7 months ago
Taeyoung96 / ROS2-Docker-tutorial
View on GitHub
ROS2 Docker tutorial with VSCode
☆13Sep 5, 2023Updated 2 years ago
tyshiwo1 / Awesome-Visual-Tokenizer
View on GitHub
Awesome Visual Tokenizers/Autoencoders
☆20Nov 19, 2025Updated 8 months ago
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆15Jul 4, 2025Updated last year
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
SilentView / EMCID
View on GitHub
Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"
☆19Mar 21, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
3dlg-hcvc / vigil3d
View on GitHub
ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding
☆20Aug 8, 2025Updated 11 months ago
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
NarcissusEx / VividDreamer
View on GitHub
☆17Feb 20, 2025Updated last year
NVlabs / zero-msf
View on GitHub
[CVPR 2025] ZeroMSF: Zero-shot Monocular Scene Flow Estimation in the Wild
☆43Sep 16, 2025Updated 10 months ago
zllrunning / deeplab-pytorch-crf
View on GitHub
PyTorch implementation of DeepLab v2 (ResNet) + COCO-Stuff 10k/164k
☆15Nov 7, 2018Updated 7 years ago
GeunminHwang / DiffuseSlide
View on GitHub
Official implementation of DiffuseSlide
☆17Jun 30, 2025Updated last year
CuiRuikai / NumGrad-Pull
View on GitHub
☆12Jan 16, 2025Updated last year
kaist-ami / LaughTalk
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
zhentao-zou / MURE
View on GitHub
Beyond Textual CoT: Interleaved Text-image chains with Deep Confidence Reasoning for Image Editing
☆19Jun 24, 2026Updated last month
MattWallingford / 360-1M
View on GitHub
☆94May 30, 2025Updated last year
UVA-Computer-Vision-Lab / FrameINO
View on GitHub
[NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
☆33May 1, 2026Updated 2 months ago
llyx97 / FETV
View on GitHub
[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…
☆56Mar 4, 2024Updated 2 years ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kaist-ami / Uni-DVPS
View on GitHub
[RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"
☆13Oct 11, 2024Updated last year
PixCtrol / PixelWizard
View on GitHub
PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolutions
☆26May 26, 2026Updated last month
hi-zhengcheng / vividzoo
View on GitHub
☆39Oct 19, 2024Updated last year
UCSB-AI / via-video
View on GitHub
☆25May 12, 2026Updated 2 months ago
alimohammadiamirhossein / cora
View on GitHub
✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025.
☆35Jun 3, 2025Updated last year
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
sjz5202 / LLaVA-Reward
View on GitHub
Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
☆26Jul 30, 2025Updated 11 months ago