RayTang88 / BeautyMaster
We hope to train VLM to be a beauty master to help you solve the problem of dressing and beauty.
☆19Updated this week
Related projects ⓘ
Alternatives and complementary repositories for BeautyMaster
- ☆77Updated 6 months ago
- A Training-free Iterative Framework for Long Story Visualization☆62Updated this week
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆26Updated last month
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆82Updated 7 months ago
- Offical code repository of “BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training”☆73Updated last month
- ☆83Updated 10 months ago
- ☆69Updated 6 months ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆31Updated 10 months ago
- JoyType: A Robust Design for Multilingual Visual Text Creation☆21Updated this week
- The Dawn of Video Generation: Preliminary Explorations with SORA-like Models☆127Updated this week
- ☆51Updated 8 months ago
- 个人项目地址,一些大语言模型和多模态模型的应用☆123Updated 2 weeks ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆42Updated 10 months ago
- ☆52Updated 2 months ago
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆148Updated last month
- Research Code for Multimodal-Cognition Team in Ant Group☆123Updated 4 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆111Updated 4 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆46Updated 4 months ago
- An initiative to replicate Sora☆99Updated 7 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆127Updated 5 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆89Updated 2 weeks ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆50Updated last month
- Building a VLM model starts from the basic module.☆10Updated 7 months ago
- IDM-VTON-training : This is an unofficial training code of idm-vton☆61Updated 3 months ago
- ☆166Updated 4 months ago
- ☆13Updated 5 months ago
- ☆99Updated 8 months ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆51Updated last year
- ☆66Updated last year
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆51Updated 2 months ago