RayTang88 / BeautyMaster
We hope to train VLM to be a beauty master to help you solve the problem of dressing and beauty.
☆20Updated 5 months ago
Alternatives and similar repositories for BeautyMaster:
Users that are interested in BeautyMaster are comparing it to the libraries listed below
- Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.☆15Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- An initiative to replicate Sora☆104Updated last year
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆48Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆138Updated 5 months ago
- MLLM @ Game☆12Updated 3 weeks ago
- Xtuner Factory☆33Updated last year
- Building a VLM model starts from the basic module.☆14Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Updated last year
- [ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated last year
- ☆104Updated last year
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆98Updated last year
- ☆83Updated 11 months ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆21Updated 5 months ago
- ☆78Updated 11 months ago
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆156Updated 7 months ago
- ☆17Updated 10 months ago
- ☆48Updated 4 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆70Updated 9 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆50Updated 5 months ago
- AAAI 2024: Visual Instruction Generation and Correction☆92Updated last year
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆35Updated last year
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆123Updated 5 months ago
- Our 2nd-gen LMM☆33Updated 11 months ago
- ☆51Updated this week
- Research Code for Multimodal-Cognition Team in Ant Group☆143Updated 9 months ago
- The official code for NeurIPS 2024 paper: Harmonizing Visual Text Comprehension and Generation☆119Updated 5 months ago
- ☆63Updated last month
- Precision Search through Multi-Style Inputs☆68Updated this week
- JoyType: A Robust Design for Multilingual Visual Text Creation☆33Updated 5 months ago