A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series models.
☆19Apr 24, 2024Updated last year
Alternatives and similar repositories for MLLM_Factory
Users that are interested in MLLM_Factory are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'☆10Dec 10, 2024Updated last year
- ☆14Aug 5, 2025Updated 7 months ago
- ☆42Sep 2, 2023Updated 2 years ago
- ☆38Oct 20, 2023Updated 2 years ago
- Code for ReMoS: 3D-Motion Conditioned Reaction Synthesis for Two-person Interactions (ECCV 2024)☆34Mar 4, 2025Updated last year
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- ☆18Sep 23, 2025Updated 5 months ago
- This is the code repository for the paper: Hand-Object Interaction Controller (HOIC): Deep Reinforcement Learning for Reconstructing Inte…☆37Jul 4, 2024Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- Token classification using Phobert Models for Vietnamese☆13Jul 8, 2022Updated 3 years ago
- ☆11Mar 11, 2024Updated last year
- Apparel Classification for Indian Ethnic Clothes☆12Feb 10, 2023Updated 3 years ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆16Jul 24, 2025Updated 7 months ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Recreating the phase functioned neural network in unreal engine 5☆15May 12, 2024Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Sep 9, 2024Updated last year
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆15Feb 10, 2026Updated 3 weeks ago
- ☆10Jul 2, 2021Updated 4 years ago
- ☆11Feb 24, 2023Updated 3 years ago
- ☆21Dec 11, 2025Updated 2 months ago
- [Advanced Photonics Research, 2021] Control tightly focused fields via manipulating pupil functions☆10Dec 25, 2024Updated last year
- Source code for the article "Animation Blend Spaces without Triangulation"☆37Mar 2, 2024Updated 2 years ago
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Mar 14, 2024Updated last year
- ☆88Jul 4, 2024Updated last year
- ☆48Jun 26, 2025Updated 8 months ago
- ☆11Apr 10, 2019Updated 6 years ago
- VMDのモーフデータをFBXに変換するためのプロジェクト☆11Dec 10, 2025Updated 2 months ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- [CVPR 2022] Code for the paper "Quantization-aware Deep Optics for Diffractive Snapshot Hyperspectral Imaging".☆16Oct 6, 2022Updated 3 years ago
- Prediction of glycopeptide fragment mass spectra by deep learning☆10Feb 20, 2024Updated 2 years ago
- Colour Manga using AI☆14Apr 8, 2025Updated 10 months ago
- Text Detection by RetinaNet with PyTorch (Code will be released soon)☆10Dec 1, 2018Updated 7 years ago
- ☆14Mar 23, 2023Updated 2 years ago
- Finetune the controlnet+stable diffusion model using diffuser☆11Sep 18, 2023Updated 2 years ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year