0606zt / PanoLlama
Panorama Generation as a Next-Token Prediction Task.
☆19Updated last month
Alternatives and similar repositories for PanoLlama:
Users that are interested in PanoLlama are comparing it to the libraries listed below
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated last week
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆53Updated last month
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated 11 months ago
- ☆45Updated 6 months ago
- Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer☆21Updated this week
- The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".☆72Updated last week
- Implementation of RSGC-BD (Blur Detection)☆47Updated 8 months ago
- [ECAI 2024] Official code for "TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models".☆28Updated 2 months ago
- For paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"☆42Updated 2 years ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 2 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated last month
- [Neurips 2023] dynpoint: dynamic neural point for view synthesis☆52Updated last year
- ☆21Updated last year
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 9 months ago
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆34Updated last year
- 【 ICLR 2025 】I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength☆108Updated 2 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆47Updated 8 months ago
- High-Speed Spiking Recognition (HSSR)☆9Updated last year
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆110Updated 6 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated last month
- [3DV 2025]🐱🐶🐲🐮🐷Official Implementation of DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer☆66Updated last month
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆157Updated 6 months ago
- MMDepth: Comprehensive MMEngine-based Framework for Monocular, Stereo & Multi-view Depth Estimation☆99Updated 2 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- DeDA: Differentiable Image Integration Library☆19Updated last year
- ☆58Updated last year
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 3 months ago
- ☆36Updated last year
- ☆41Updated 2 weeks ago
- ☆43Updated last year