xxyQwQ / GenAgent

☆120

Related projects ⓘ

Alternatives and complementary repositories for GenAgent

farewellthree / PPLLaVA
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆94Updated this week
vaew / SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
☆99Updated this week
TechxGenus / CursorCore
CursorCore: Assist Programming through Aligning Anything
☆67Updated last month
Deaddawn / MovieLLM-code
☆166Updated 4 months ago
qinghew / CharacterFactory
🔥 CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models
☆195Updated 4 months ago
HaozheZhao / UltraEdit
☆173Updated 3 months ago
tryonlabs / FLUX.1-dev-LoRA-Outfit-Generator
FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.
☆42Updated 2 weeks ago
THUDM / CogView3
text to image to generation: CogView3-Plus and CogView3(ECCV 2024)
☆245Updated last month
DiffusionGPT / DiffusionGPT
☆197Updated 10 months ago
aim-uofa / AutoStory
☆145Updated 2 months ago
CharlesGong12 / RECE
[ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
☆59Updated 3 weeks ago
gpt4video / GPT4Video
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
☆134Updated 3 weeks ago
modelscope / lite-sora
An initiative to replicate Sora
☆99Updated 7 months ago
ClosedCharacter / Peach
我们是第一个完全可商用的角色大模型。
☆36Updated 3 months ago
mulanai / MuLan
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
☆127Updated 5 months ago
CodeGoat24 / MagicFace
Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.
☆51Updated this week
junjiehe96 / UniPortrait
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
☆202Updated last month
AILab-CVC / SEED-X
Multimodal Models in Real World
☆403Updated 3 weeks ago
Yuanshi9815 / Video-Infinity
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
☆163Updated 3 months ago
OpenGVLab / Diffree
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
☆234Updated 3 months ago
ECNU-CILAB / DiffutoonProjectPage
The project page of Diffutoon
☆26Updated 9 months ago
jwmao1 / story-adapter
A Training-free Iterative Framework for Long Story Visualization
☆61Updated this week
Liuziyu77 / Soda
Search, organize, discover anything!
☆47Updated 7 months ago
okaris / omni-zero-couples
A diffusers pipeline for zero shot stylised couples portrait creation
☆91Updated last month
Vision-CAIR / LongVU
☆278Updated 2 weeks ago
feizc / CogvideX-Interpolation
Keyframe Interpolation with CogvideoX
☆84Updated 3 weeks ago
victorchall / genmoai-smol
The best OSS video generation models
☆121Updated 3 weeks ago
Jeff-LiangF / FlowVid
☆141Updated 4 months ago
cnzzx / VSA
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
☆87Updated 2 weeks ago
design-edit / DesignEdit
Code for DesignEdit
☆309Updated 4 months ago