使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
☆58Sep 8, 2024Updated last year
Alternatives and similar repositories for MLLM-Finetuning-Demo
Users that are interested in MLLM-Finetuning-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated 2 years ago
- 基于ChatGLM3基座模型和LLAMA-Factory框架进行微调的一个中医问答机器人☆113Jan 3, 2024Updated 2 years ago
- Code of the Grounded MUIE model, REAMO☆10Dec 3, 2024Updated last year
- ☆16Aug 1, 2025Updated 9 months ago
- Accompanying code for the paper "Conditional Unscented Autoencoders for Trajectory Prediction"☆16Sep 6, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆37Nov 13, 2024Updated last year
- ☆15Feb 28, 2024Updated 2 years ago
- 小红书网页版助手,一款支持固定在电脑桌面上进行小窗模式浏览阅读、多账户同时登录提升用户活跃度、图片 笔记 视频批量自动化下载等功能的软件助手,让用户在小红书笔记阅读上,获得更开阔的视觉体验和交互享受。☆10Jul 22, 2024Updated last year
- Turn Dify API into OpenAI API schema☆17Aug 16, 2024Updated last year
- Code implementation of "Information Design in Multi-Agent Reinforcement Learning"☆15Aug 18, 2023Updated 2 years ago
- Knowledge Distillation using Contrastive Language-Image Pretraining (CLIP) without a teacher model.☆20Sep 6, 2024Updated last year
- ☆10Nov 29, 2022Updated 3 years ago
- 批量视频分段助手工具,解决剪映处理视频分割太慢了,而且需要一个一个分割视频的困境,生成了一个python写的分段视频提高分辨率增加音量的小工具☆13Aug 25, 2023Updated 2 years ago
- ☆14Jul 21, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Supervised classification to predict rock facies and a T-test flow to evaluate the prediction performance.☆16Dec 31, 2020Updated 5 years ago
- Popular Short Video AI Creative Assistant 爆款视频脚本AI创作助手,结合了最先进的AI技术的视频脚本生成器,专为小红书等平台的视频创作者设计。通过调用Chat GPT-4.0的强大API,为您生成爆款视频标题、内容脚本以及最合适的标…☆12Jun 28, 2024Updated last year
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆18Nov 14, 2025Updated 5 months ago
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆29Jan 27, 2025Updated last year
- 使用指令微调对大模型进行微调。☆11Jun 28, 2023Updated 2 years ago
- ☆11Jun 21, 2025Updated 10 months ago
- Lithology identification by using well log data is an initial and fundamental step within petroleum geosciences☆14Jul 13, 2024Updated last year
- 把微信的文件传输助手改造成为个人智能碎片信息收集箱,AI分类,AI总结☆12Jan 31, 2024Updated 2 years ago
- 爬取抖音号下所有视频以及视频描述信息等🤭☆12May 23, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 永久免费 - 企业微信营销 SCRM 系统|AI私域助手☆12Nov 21, 2023Updated 2 years ago
- This is an official implementation of KDAS for Knowledge Distillation Polyp Segmentation (ICME 2024)☆17Oct 3, 2024Updated last year
- Public code repository to reproduce our MICCAI 2022 paper: "Automatic identification of segmentation errors for radiotherapy using geomet…☆11Dec 8, 2022Updated 3 years ago
- ☆10Mar 21, 2022Updated 4 years ago
- 一个基于大模型微调的中文医疗问答机器人应用☆25Jan 11, 2024Updated 2 years ago
- [NeurIPS 2025] Official Implementation for "Glocal Information Bottleneck for Time Series Imputation"☆14Nov 4, 2025Updated 6 months ago
- EHR datasets preprocessing scripts☆11Jan 31, 2024Updated 2 years ago
- 面试辅助系统是一个基于AI的工具,可以将面试官的音频实时转换为文字,并提供合适的回答。支持知识库方案。☆28Mar 25, 2025Updated last year
- ☆13Aug 23, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆17Sep 29, 2025Updated 7 months ago
- A generator for golang alfred workflow that helps you create boilerplate code.☆13Jun 26, 2024Updated last year
- The code for ICCV2021 Light Field Saliency Detection with Dual Local Graph Learning and Reciprocative Guidance☆12Apr 1, 2022Updated 4 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- NeurIPS2024-Papers-about-Autonomous-Driving☆19Nov 18, 2024Updated last year
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- material about gaze estimation or gaze tracking for codes, papers and demos.☆10Jul 19, 2021Updated 4 years ago