RayTang88 / BeautyMaster
We hope to train VLM to be a beauty master to help you solve the problem of dressing and beauty.
☆14Updated last month
Related projects: ⓘ
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆51Updated 3 weeks ago
- 八戒-Chat是利用《西游记》剧本中所有关于猪八戒的台词和语句,以及Chat-GPT-3.5生成的相关问题结果,基于Internlm进行QLoRA微调得到的模仿猪八戒语气的聊天语言模型。☆22Updated last month
- ☆96Updated 6 months ago
- ☆46Updated 6 months ago
- Minicpm和MiniCPM-V的项目和教程。包括推理,量化,边端部署,微调,技术报告、应用六个主题☆87Updated last week
- ☆77Updated 4 months ago
- Collection of image and video datasets for generative AI and multimodal visual AI☆17Updated 4 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆36Updated 8 months ago
- 基于InternLM2大模型的离线具身智能导盲犬☆60Updated 5 months ago
- ☆70Updated last month
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使 用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆76Updated 5 months ago
- ☆64Updated 4 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆21Updated 2 weeks ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆62Updated 4 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆111Updated 2 months ago
- [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge☆114Updated 2 months ago
- AAAI 2024: Visual Instruction Generation and Correction☆86Updated 7 months ago
- Xtuner Factory☆29Updated 6 months ago
- A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing☆282Updated 2 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆94Updated last year
- This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"☆105Updated last month
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆136Updated last month
- 多模态 MM +Chat 合集☆187Updated 2 weeks ago
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆204Updated 7 months ago
- Multimodal chatbot with computer vision capabilities integrated☆98Updated 4 months ago
- 包含程序员面试大厂面试题和面试经验☆84Updated last month
- Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs☆72Updated 3 months ago
- ☆145Updated 2 months ago
- QiDiHui: RAG, appbuilder, ErnieBot, multi-model, 十万个为什么☆15Updated last month
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆36Updated 5 months ago