这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)
☆37Aug 22, 2025Updated 6 months ago
Alternatives and similar repositories for VLM-Finetuning
Users that are interested in VLM-Finetuning are comparing it to the libraries listed below
Sorting:
- A simple project used for Image Classification, including train and predict in Pytorch, do inference in Pytorch C++ API and TensorRT☆18Jun 15, 2020Updated 5 years ago
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆16Jan 7, 2026Updated last month
- ☆18Jan 2, 2026Updated 2 months ago
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- Code to BraTS 2023 challenge.☆13May 5, 2025Updated 9 months ago
- ☆12Sep 23, 2022Updated 3 years ago
- ☆46Nov 12, 2025Updated 3 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆57Apr 14, 2025Updated 10 months ago
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- A template for Tensorflow 2.0 + Keras projects☆12Mar 25, 2023Updated 2 years ago
- ☆13Jan 16, 2026Updated last month
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year
- Code for ISBI 2024 paper "Fully Differentiable Correlation-driven 2D/3D Registration for X-Ray to CT Image Fusion"☆10Aug 26, 2024Updated last year
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 2 years ago
- ☆12Nov 26, 2020Updated 5 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- This is a dataset (paired cloud and cloud-free Sentinel-2A image patches)☆11Jul 14, 2025Updated 7 months ago
- API Utility for TOR(The Onion ROUTER) such as requesting a new IP, or generating API password. Uses Network API for control☆12Feb 27, 2025Updated last year
- ☆10Nov 12, 2020Updated 5 years ago
- Multi-modal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms☆13Mar 9, 2023Updated 2 years ago
- ppt转数字人后台☆17Apr 9, 2025Updated 10 months ago
- Dual Quaternion implementation in python.☆11Nov 30, 2016Updated 9 years ago
- This is a complete online exam system☆10Dec 27, 2019Updated 6 years ago
- Simple python interface to be used with crisp_controllers.☆31Updated this week
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated last year
- 3D U-NetR: Low Dose Computed Tomography Reconstruction via Deep Learning and 3 Dimensional Convolutions☆12Oct 16, 2022Updated 3 years ago
- Arxiv automatically obtains the latest article service.☆11Apr 29, 2020Updated 5 years ago
- Repository for Master thesis project investigating classification of 3D chest CT scans using Vision Transformer.☆14Aug 29, 2023Updated 2 years ago
- A *pretty useful* practice for tensorflow-2.0 project template architecture.☆11Apr 23, 2019Updated 6 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆27Sep 2, 2025Updated 6 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- ☆11Mar 15, 2023Updated 2 years ago
- Loop Clousure Detector☆14Feb 2, 2018Updated 8 years ago
- ☆18Jun 14, 2025Updated 8 months ago
- This is an example of creating an AI agent with flowchart☆12Jul 22, 2024Updated last year
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为 用户提供一份专业、详尽且富有洞察力的面相分析报告☆21Jul 14, 2025Updated 7 months ago
- official code for Dynamic Smooth Label Assignment☆11Oct 5, 2022Updated 3 years ago
- LDM-Morph: Latent diffusion model guided deformable image registration☆14Jan 19, 2025Updated last year