RayTang88 / BeautyMasterLinks
We hope to train VLM to be a beauty master to help you solve the problem of dressing and beauty.
☆22Updated last month
Alternatives and similar repositories for BeautyMaster
Users that are interested in BeautyMaster are comparing it to the libraries listed below
Sorting:
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆143Updated last year
- ☆79Updated last year
- Precision Search through Multi-Style Inputs☆73Updated 3 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- ☆72Updated 2 years ago
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆105Updated last year
- An initiative to replicate Sora☆104Updated last year
- Our 2nd-gen LMM☆34Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆143Updated 10 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated last year
- AAAI 2024: Visual Instruction Generation and Correction☆93Updated last year
- ☆103Updated last year
- ☆28Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆256Updated 3 weeks ago
- MLLM @ Game☆14Updated 6 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆169Updated last month
- ☆116Updated 2 years ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆126Updated last year
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆158Updated last year
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆15Updated last year
- Taiyi-Diffusion-XL训练代码☆23Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Updated last year
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆169Updated last year
- ☆184Updated 3 months ago
- ☆90Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆76Updated last year
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆70Updated last year
- 调用大模型已经是如今做 ai 项目习以为常的工作的,但是大模型的输出很多时候是不可控的,我们又需要使用大模型去做各种下游任务,实现可控可解析的输出。我们探索了一种和 python 开发可以紧密合作的开发方法。☆29Updated last year
- ☆72Updated last year