这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)
☆39Aug 22, 2025Updated 8 months ago
Alternatives and similar repositories for VLM-Finetuning
Users that are interested in VLM-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple project used for Image Classification, including train and predict in Pytorch, do inference in Pytorch C++ API and TensorRT☆18Jun 15, 2020Updated 5 years ago
- ☆16Mar 24, 2025Updated last year
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- official code for Dynamic Smooth Label Assignment☆11Oct 5, 2022Updated 3 years ago
- ☆47Nov 12, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 8 months ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆59Apr 14, 2025Updated last year
- Loop Clousure Detector☆13Feb 2, 2018Updated 8 years ago
- Simple python interface to be used with crisp_controllers.☆35Apr 14, 2026Updated 3 weeks ago
- The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023☆13Nov 9, 2023Updated 2 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- Happy Hacking With Claude!!!☆25Oct 27, 2025Updated 6 months ago
- Just a simple Android app that uses Rokid's CXR-M SDK to upload/sideload an APK onto your Rokid glasses over Wi-Fi. It might be hard to g…☆42Apr 9, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 6 months ago
- 分类任务的 Focal Loss,PyTorch 实现☆11Jun 13, 2023Updated 2 years ago
- AI驱动的虚拟数字人直播系统,支持2D/3D数字人、TTS、ASR、唇形同步、推流、互动等模块化开发。☆24May 13, 2025Updated 11 months ago
- ☆10Nov 12, 2020Updated 5 years ago
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆572Sep 8, 2025Updated 7 months ago
- ☆15Jan 15, 2024Updated 2 years ago
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆17Mar 26, 2025Updated last year
- Multi-modal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms☆13Mar 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACM MM 2025] LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks☆22Nov 18, 2025Updated 5 months ago
- ☆43Mar 23, 2026Updated last month
- ☆12Sep 23, 2022Updated 3 years ago
- VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation☆14Sep 27, 2023Updated 2 years ago
- This sample shows how to deploy an industrial computer vision model to detect real world analog pointer meters and extract corresponding …☆12Sep 23, 2022Updated 3 years ago
- ☆10Mar 1, 2021Updated 5 years ago
- ☆15Dec 16, 2021Updated 4 years ago
- Visual SLAM from RGB-D data using Microsoft Kinect☆10May 13, 2016Updated 9 years ago
- Parallelize the serial implementation of 3D scene reconstruction with input from kinect sensor and run it on NvidiaGPU using CUDA.☆12Nov 2, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- Dual Quaternion implementation in python.☆11Nov 30, 2016Updated 9 years ago
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- The improved model for multi-object detection and lane line segmentation based on the YoloP model.☆15Nov 5, 2022Updated 3 years ago
- ☆59Jun 8, 2025Updated 10 months ago
- Region growing for automatic spine segmentation☆11Apr 1, 2020Updated 6 years ago