mdsrqbl / omnihuman
AI model that understands text & humanoids.
☆87Updated 7 months ago
Alternatives and similar repositories for omnihuman:
Users that are interested in omnihuman are comparing it to the libraries listed below
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆78Updated last week
- ☆102Updated last month
- YuE: Open Full-song Generation Foundation for the GPU Poor☆350Updated last month
- project page for ChatAnyone☆15Updated this week
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆429Updated 2 weeks ago
- ☆55Updated 2 weeks ago
- HunyuanVideo GP: Large Video Generation Model - GPU Poor version☆380Updated this week
- An AI focused photo manipulation tool based on Gradio☆184Updated last month
- ☆560Updated this week
- Wan 2.1 AI Video Generator Web UI☆22Updated last month
- RunwayML Gen2 and Gen3 unofficial client to generate videos using AI☆70Updated last month
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆183Updated 8 months ago
- ☆68Updated this week
- ☆297Updated 9 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆61Updated 4 months ago
- ☆13Updated 3 months ago
- Open Sourced NoteBookLM☆58Updated 6 months ago
- ☆716Updated last month
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆693Updated 3 weeks ago
- Cosmos1GP for the GPU Poor by DeepBeepMeep☆60Updated last month
- Inference service for Qwen2.5-VL-7b model☆166Updated last week
- Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"☆163Updated last month
- A diffusers pipeline for zero shot stylised couples portrait creation☆100Updated 3 months ago
- ☆103Updated 3 weeks ago
- ☆21Updated 4 months ago
- ☆26Updated last year
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆122Updated 2 weeks ago
- ☆43Updated last year
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆193Updated last week
- Python GUI using OpenAI to make video stories from real-time Craigslist data☆37Updated last year