0606zt / PanoLlamaLinks
Panorama Generation as a Next-Token Prediction Task.
☆20Updated 2 months ago
Alternatives and similar repositories for PanoLlama
Users that are interested in PanoLlama are comparing it to the libraries listed below
Sorting:
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated last month
- ☆45Updated 7 months ago
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated last year
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆53Updated 3 months ago
- Official Code of Logits-Based-Finetuning☆85Updated last week
- The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".☆74Updated last month
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆70Updated 3 weeks ago
- [ECAI 2024] Official code for "TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models".☆28Updated 4 months ago
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆35Updated last year
- [3DV 2025]🐱🐶🐲🐮🐷Official Implementation of DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer☆66Updated 3 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated 2 months ago
- ☆44Updated 2 months ago
- [Neurips 2023] dynpoint: dynamic neural point for view synthesis☆52Updated last year
- ☆107Updated this week
- Official implementation of "Generating images with 3D annotations using diffusion models".☆49Updated 10 months ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 4 months ago
- For paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"☆43Updated 2 years ago
- 【 ICLR 2025 】I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength☆109Updated 3 months ago
- Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer☆114Updated last month
- ☆82Updated 2 weeks ago
- Flattening-Net: Deep Regular 2D Representation for 3D Point Cloud Analysis☆62Updated 6 months ago
- Implementation of RSGC-BD (Blur Detection)☆47Updated 10 months ago
- Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis [ACMMM 2022]☆27Updated last year
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 11 months ago
- ☆21Updated last year
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆231Updated last month
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 5 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- ☆14Updated last month
- DeDA: Differentiable Image Integration Library☆19Updated last year