xxyQwQ / GenAgent
☆114Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for GenAgent
- CursorCore: Assist Programming through Aligning Anything☆64Updated 3 weeks ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆97Updated 2 months ago
- ☆165Updated 4 months ago
- ☆196Updated 9 months ago
- ☆168Updated 3 months ago
- 🔥 CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models☆194Updated 3 months ago
- text to image to generation: CogView3-Plus and CogView3(ECCV 2024)☆241Updated 3 weeks ago
- ☆145Updated 2 months ago
- An initiative to replicate Sora☆98Updated 6 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆71Updated 6 months ago
- Search, organize, discover anything!☆47Updated 6 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆127Updated 5 months ago
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆30Updated this week
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆233Updated 3 months ago
- A Training-free Iterative Framework for Long Story Visualization☆58Updated last month
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆132Updated last week
- ☆72Updated 10 months ago
- Multimodal Models in Real World☆400Updated last week
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆166Updated 3 months ago
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆140Updated 3 months ago
- Keyframe Interpolation with CogvideoX☆79Updated last week
- ☆123Updated last week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆55Updated this week
- The project page of Diffutoon☆26Updated 9 months ago
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community …☆54Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆362Updated this week
- The best OSS video generation models☆117Updated 2 weeks ago
- gradio WebUI for AdvancedLivePortrait☆125Updated this week
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆163Updated 3 months ago