hyc2026 / StoryTeller
☆23Updated last week
Related projects ⓘ
Alternatives and complementary repositories for StoryTeller
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆39Updated 3 months ago
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆20Updated last week
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆41Updated 3 weeks ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆26Updated 3 months ago
- Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆60Updated last month
- ☆33Updated 10 months ago
- LiVOS: Light Video Object Segmentation with Gated Linear Matching☆18Updated 2 weeks ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated last month
- Wire Removal Video Datasets 2(WRV2)☆25Updated 6 months ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆31Updated 7 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆17Updated 3 weeks ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆47Updated 6 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆60Updated 6 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆30Updated 6 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆31Updated last week
- Official Repository of Personalized Visual Instruct Tuning☆24Updated 2 weeks ago
- The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"☆33Updated 2 months ago
- [ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation☆56Updated 2 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆57Updated last month
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Updated 10 months ago
- Code release for AccDiffusion (ECCV 2024)☆69Updated this week
- ☆33Updated 9 months ago
- The official repo of continuous speculative decoding☆16Updated this week
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆112Updated last month
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆48Updated this week
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆35Updated 2 weeks ago
- ☆40Updated 11 months ago
- ☆13Updated 3 weeks ago
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆52Updated last year
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago