BUAADreamer / MLLM-Finetuning-DemoView external linksLinks
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
☆56Sep 8, 2024Updated last year
Alternatives and similar repositories for MLLM-Finetuning-Demo
Users that are interested in MLLM-Finetuning-Demo are comparing it to the libraries listed below
Sorting:
- ☆14Jul 21, 2023Updated 2 years ago
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆26Jan 27, 2025Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated last year
- ☆16Sep 23, 2025Updated 4 months ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- A repository for OpenHack for Lakehouse. The contents are written in Japanese.☆11Nov 20, 2023Updated 2 years ago
- A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.☆17Aug 12, 2025Updated 6 months ago
- ☆10Nov 29, 2022Updated 3 years ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Jan 26, 2026Updated 2 weeks ago
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 2 years ago
- Automatically Detect Out of focus and blurry images☆10Nov 18, 2016Updated 9 years ago
- ☆13Oct 4, 2024Updated last year
- 天池学习赛——街景字符编码识别☆16Apr 5, 2021Updated 4 years ago
- Official repository for the WACV 2024 paper "Multi-view Classification with Hybrid Fusion and Mutual Distillation"☆15Jan 16, 2024Updated 2 years ago
- The FaceFX Unreal Engine 5 plugin.☆10Sep 23, 2025Updated 4 months ago
- Public code repository to reproduce our MICCAI 2022 paper: "Automatic identification of segmentation errors for radiotherapy using geomet…☆11Dec 8, 2022Updated 3 years ago
- YOLOv8 Knowledge Distillation☆10Dec 28, 2024Updated last year
- Code for our TVCG paper "DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera".☆19Aug 22, 2025Updated 5 months ago
- PyTorch implementation of paper "Space-time Neural Irradiance Fields for Free-Viewpoint Video"☆11Jul 29, 2021Updated 4 years ago
- ARCV2.0 updated the package with ARKit 2.0☆11Feb 24, 2019Updated 6 years ago
- Python library for adding visual effects to video streams☆11Dec 20, 2019Updated 6 years ago
- ☆14Nov 25, 2024Updated last year
- Non-negative matrix factorization using MUR, ANLS, ADMM or AO-ADMM.☆10Aug 20, 2023Updated 2 years ago
- Modified version of okvis-1.1.3, include okvis ros wrapper and the core code☆10Sep 18, 2022Updated 3 years ago
- This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.☆19Nov 10, 2025Updated 3 months ago
- ☆11Oct 29, 2025Updated 3 months ago
- Just a helper script for invoking kohya converter (and maybe a cheeky inferencer to check it worked okay)☆11Aug 26, 2023Updated 2 years ago
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆16Sep 29, 2025Updated 4 months ago
- ☆11Sep 15, 2016Updated 9 years ago
- 神经辐射场 论文学习☆10Sep 25, 2021Updated 4 years ago
- ☆11Jan 16, 2025Updated last year
- This is a pytorch implement of SCDA (selective deep descriptors aggregation for fine-grained image retrieval), which is fully translated …☆20Mar 8, 2022Updated 3 years ago
- Deformable DETR: Deformable Transformers for End-to-End Object Detection. This is an alternative for running custom datasets on Deformabl…☆14Jan 24, 2022Updated 4 years ago
- Code for our CICAI 2022 paper "3D Face Cartoonizer: Generating Personalized 3D Cartoon Faces from 2D Real Photos with a Hybrid Dataset".☆10Aug 9, 2022Updated 3 years ago
- LLM-based character segmentation agent for ComfyUI based on SAM 3 and the SAM 3 Agent notebook☆25Dec 22, 2025Updated last month
- ☆18Aug 1, 2025Updated 6 months ago
- LoRa localization using a drone swarm. Master project at the Swisscom Digital Lab supervised by LIS-EPFL. February-August 2019.☆12Jul 23, 2019Updated 6 years ago