这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)
☆39Aug 22, 2025Updated 7 months ago
Alternatives and similar repositories for VLM-Finetuning
Users that are interested in VLM-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Mar 24, 2025Updated last year
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- official code for Dynamic Smooth Label Assignment☆11Oct 5, 2022Updated 3 years ago
- Code for "RSF: Optimizing Rigid Scene Flow From 3D Point Clouds Without Labels"☆10Jan 17, 2023Updated 3 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音 频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 7 months ago
- This repository contains the source code related to the paper Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation☆11Jun 23, 2020Updated 5 years ago
- ☆12May 19, 2024Updated last year
- ☆33Dec 17, 2025Updated 3 months ago
- Simple python interface to be used with crisp_controllers.☆34Apr 2, 2026Updated 2 weeks ago
- The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023☆13Nov 9, 2023Updated 2 years ago
- ☆18Jun 14, 2025Updated 10 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- ☆34Nov 18, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 5 months ago
- Visualize nuScenes sequences on ROS with full tf support.☆21Sep 27, 2023Updated 2 years ago
- Happy Hacking With Claude!!!☆24Oct 27, 2025Updated 5 months ago
- 分类任务的 Focal Loss,PyTorch 实现☆11Jun 13, 2023Updated 2 years ago
- AI驱动的虚拟数字人直播系统,支持2D/3D数字人、TTS、ASR、唇形同步、推流、互动等模块化开发。☆23May 13, 2025Updated 11 months ago
- Nonrigid Iterative Closest Point Algorithm☆10Feb 19, 2016Updated 10 years ago
- 3D_lut generate for surround view☆13Jul 31, 2019Updated 6 years ago
- Registration between 3d volume and 2d images.☆10Dec 21, 2018Updated 7 years ago
- ppt转数字人后台☆19Apr 9, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆566Sep 8, 2025Updated 7 months ago
- Object-Region Video Transformers☆24Mar 24, 2022Updated 4 years ago
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆16Mar 26, 2025Updated last year
- Multi-modal 3D ultrasound and CT in image-guided spinal surgery: public database and new registration algorithms☆13Mar 9, 2023Updated 3 years ago
- ☆42Mar 23, 2026Updated 3 weeks ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆93Jul 17, 2025Updated 8 months ago
- ☆12Sep 23, 2022Updated 3 years ago
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- ocr with yolo3 as feature extractor, implemented by keras, and accelerated by tensorrt☆34Aug 7, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This sample shows how to deploy an industrial computer vision model to detect real world analog pointer meters and extract corresponding …☆12Sep 23, 2022Updated 3 years ago
- ☆15Dec 16, 2021Updated 4 years ago
- Parallelize the serial implementation of 3D scene reconstruction with input from kinect sensor and run it on NvidiaGPU using CUDA.☆12Nov 2, 2016Updated 9 years ago
- Visual SLAM from RGB-D data using Microsoft Kinect☆10May 13, 2016Updated 9 years ago
- 自动生成短视频,文章自动成片,多模态混剪,数字人,声音克隆☆13Jun 25, 2024Updated last year
- This is a project to fuse GPS/IMU/Wheel odometry for the vehicle localization.☆14Aug 19, 2020Updated 5 years ago
- Code for ISBI 2024 paper "Fully Differentiable Correlation-driven 2D/3D Registration for X-Ray to CT Image Fusion"☆10Aug 26, 2024Updated last year