这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)
☆40Aug 22, 2025Updated 9 months ago
Alternatives and similar repositories for VLM-Finetuning
Users that are interested in VLM-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple project used for Image Classification, including train and predict in Pytorch, do inference in Pytorch C++ API and TensorRT☆18Jun 15, 2020Updated 5 years ago
- ☆16Mar 24, 2025Updated last year
- Classify Traffic Signs.☆10Jan 31, 2017Updated 9 years ago
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- official code for Dynamic Smooth Label Assignment☆11Oct 5, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 3 years ago
- ☆47Nov 12, 2025Updated 6 months ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 8 months ago
- Face++ 是一款基 于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆22Jul 14, 2025Updated 10 months ago
- ☆12May 19, 2024Updated 2 years ago
- 私有化自动数字人排队训练、短视频排队生成的微信小程序、web运营后台管理系统一键部署,基于单人训练的音频驱动唇形,比wav2lip、deepfacelab、liveportrait、musetalk等等唇形方案更好,直接可以商业化,支持中日英韩多种语音复刻☆62May 10, 2026Updated 2 weeks ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- Just a simple Android app that uses Rokid's CXR-M SDK to upload/sideload an APK onto your Rokid glasses over Wi-Fi. It might be hard to g…☆45Apr 9, 2026Updated last month
- ☆34Nov 18, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 6 months ago
- 分类任务的 Focal Loss,PyTorch 实现☆11Jun 13, 2023Updated 2 years ago
- Nonrigid Iterative Closest Point Algorithm☆10Feb 19, 2016Updated 10 years ago
- 3D_lut generate for surround view☆13Jul 31, 2019Updated 6 years ago
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- ☆15Jan 15, 2024Updated 2 years ago
- Object-Region Video Transformers☆24Mar 24, 2022Updated 4 years ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆583Sep 8, 2025Updated 8 months ago
- ☆43Mar 23, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆93Jul 17, 2025Updated 10 months ago
- ☆12Sep 23, 2022Updated 3 years ago
- VesNet-RL: Simulation-based ReinforcementLearning for Real-World US Probe Navigation☆14Sep 27, 2023Updated 2 years ago
- ☆15Dec 16, 2021Updated 4 years ago
- Visual SLAM from RGB-D data using Microsoft Kinect☆10May 13, 2016Updated 10 years ago
- Parallelize the serial implementation of 3D scene reconstruction with input from kinect sensor and run it on NvidiaGPU using CUDA.☆12Nov 2, 2016Updated 9 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- Arxiv automatically obtains the latest article service.☆11Apr 29, 2020Updated 6 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆14Dec 13, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for ISBI 2024 paper "Fully Differentiable Correlation-driven 2D/3D Registration for X-Ray to CT Image Fusion"☆10Aug 26, 2024Updated last year
- Pyramid Attention Network for Medical Image Registration (ISBI 2024)☆16Feb 6, 2025Updated last year
- 参考u2net自定义dataset和训练代码训练自己的数据集(基础班本)☆12Apr 20, 2022Updated 4 years ago
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- The improved model for multi-object detection and lane line segmentation based on the YoloP model.☆15Nov 5, 2022Updated 3 years ago
- ☆59Jun 8, 2025Updated 11 months ago
- Region growing for automatic spine segmentation☆11Apr 1, 2020Updated 6 years ago