7Alive7/VLM-Finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/7Alive7/VLM-Finetuning)

7Alive7 / VLM-Finetuning

这是一个不基于任何框架实现的从0到1的VLM finetune（包括Pre-train和SFT）

☆39

Alternatives and similar repositories for VLM-Finetuning

Users that are interested in VLM-Finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YingXiuHe / u2net-pytorch
View on GitHub
参考u2net自定义dataset和训练代码训练自己的数据集（基础班本）
☆12Apr 20, 2022Updated 4 years ago
ShaohonChen / Qwen3-SmVL
View on GitHub
将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调
☆602Sep 8, 2025Updated 10 months ago
alibaba-damo-academy / VL-Cogito
View on GitHub
☆24Nov 4, 2025Updated 8 months ago
inkyusa / se2-loftr
View on GitHub
☆18May 28, 2022Updated 4 years ago
storyandwine / CourseSelectionWeapp
View on GitHub
一个简单的选课系统实现，基于微信小程序云开发，前端采用vant weapp框架
☆14Jul 11, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CabbageWust / Image_Classify_pytorch
View on GitHub
A simple project used for Image Classification, including train and predict in Pytorch, do inference in Pytorch C++ API and TensorRT
☆18Jun 15, 2020Updated 6 years ago
tengwang0318 / hierarchial_reward_model
View on GitHub
[ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"
☆20Mar 25, 2025Updated last year
xt4d / SparseGNV
View on GitHub
SparseGNV: Generating Novel Views of Indoor Scenes with Sparse Input Views
☆19Feb 27, 2024Updated 2 years ago
libing64 / Qwen2.5-VL-Fine-Tuning
View on GitHub
☆34Mar 2, 2025Updated last year
Hsdxm / hisi-yolov5
View on GitHub
海思设备上部署阉割版yolov5
☆13Nov 22, 2021Updated 4 years ago
mever-team / SAGI
View on GitHub
Official code repository for the paper A Large-scale AI-generated Image Inpainting Benchmark
☆17Jan 13, 2026Updated 6 months ago
WorldEditor50 / v4l2camera
View on GitHub
☆16Mar 24, 2025Updated last year
paramaggarwal / CarND-Traffic-Sign-Classifier-Project
View on GitHub
Classify Traffic Signs.
☆10Jan 31, 2017Updated 9 years ago
DebeshJha / TransNetR
View on GitHub
Official implementation of TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution Tes…
☆26Feb 23, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
newintelligence4 / BEVfusion_preprocess
View on GitHub
Multiple Lidar preprocessor for BEVfusion
☆11Aug 25, 2023Updated 2 years ago
buaa-colalab / VGGT-S
View on GitHub
[CVPR‘26 Oral] VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation
☆21May 23, 2026Updated 2 months ago
davezdeng8 / rsf
View on GitHub
Code for "RSF: Optimizing Rigid Scene Flow From 3D Point Clouds Without Labels"
☆10Jan 17, 2023Updated 3 years ago
YonghaoHe / DSLA
View on GitHub
official code for Dynamic Smooth Label Assignment
☆12Oct 5, 2022Updated 3 years ago
dengyanbo / PointCloud_RoadBoundaryDetection
View on GitHub
☆15May 6, 2018Updated 8 years ago
RunpeiDong / DGMS
View on GitHub
[ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
☆11May 21, 2023Updated 3 years ago
yunhao-tech / Course_project
View on GitHub
☆16Apr 8, 2023Updated 3 years ago
NNU-GISA / Lane-Detection-from-Point-Cloud
View on GitHub
Detect lanes from the point cloud of a 80-meter highway road
☆18Jun 17, 2019Updated 7 years ago
engcang / utility_codes
View on GitHub
A collection of various utility codes coded myself
☆13Oct 10, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wangx1996 / Lidar-pcd-2-jpg-of-bird-eye-view-
View on GitHub
Take bird-eye-view Jpg file from pcd file.
☆11Apr 29, 2019Updated 7 years ago
lxl24 / SwinTransformerV2_TensorRT
View on GitHub
For 2022 Nvidia Hackathon
☆22Jun 28, 2022Updated 4 years ago
CXR-AL14 / CXR-Code
View on GitHub
☆12Sep 23, 2022Updated 3 years ago
HiLab-git / DCA-Net
View on GitHub
☆12May 19, 2024Updated 2 years ago
B1ANKC-MOV / SpringVue
View on GitHub
SpringVue全栈学习配套代码·纯手搓·非全部
☆16Jan 16, 2024Updated 2 years ago
FudanOCR / FudanOCR
View on GitHub
FudanOCR: A modularized and extensible OCR framework for text detection and recognition. The model group contains CRNN, MORAN, EAST and s…
☆11Dec 8, 2022Updated 3 years ago
snmnmin12 / VO-with-Loop-Clousre-Detector
View on GitHub
Loop Clousure Detector
☆13Feb 2, 2018Updated 8 years ago
kitsch231 / pytorch_fake_news_Classification_mml
View on GitHub
使用pytorch完成的一个多模态分类任务，文本和图像部分分别使用了bert和resnet提取特征（在config里可以组合多种模型）,在我的小规模数据集上取得了良好的性能（验证集acc96%）
☆82Mar 25, 2023Updated 3 years ago
lemon-little / BetterSynth
View on GitHub
天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案
☆30Oct 30, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JiaxiongQ / GauSim
View on GitHub
A real-world autonomous driving simulator based on 3D Gaussian Splatting for scene augmentation
☆16Jun 10, 2024Updated 2 years ago
GitHubOfHyl97 / SkeAttnCLR
View on GitHub
The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023
☆13Nov 9, 2023Updated 2 years ago
Martlgap / x-face-verification
View on GitHub
Repo for our Paper: Explainable Model-Agnostic Similarity and Confidence in Face Verification
☆18Sep 28, 2023Updated 2 years ago
Yachao-Zhang / PSD
View on GitHub
Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation (ICCV2021)
☆21Dec 12, 2022Updated 3 years ago
yujunhuics / Reyes
View on GitHub
2025.01：从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两…
☆34Feb 10, 2026Updated 5 months ago
ICTMCG / GRE
View on GitHub
Generative Regional Editing (GRE) Benchmark
☆20Sep 10, 2024Updated last year
tcmyxc / FocalLoss
View on GitHub
分类任务的 Focal Loss，PyTorch 实现
☆10Jun 13, 2023Updated 3 years ago