uygarkurt / Fine-Tune-VLMsLinks

☆19

Alternatives and similar repositories for Fine-Tune-VLMs

Users that are interested in Fine-Tune-VLMs are comparing it to the libraries listed below

Sorting:

ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆74Updated 3 weeks ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆82Updated last year
Sayandip170900 / CUDA-Challenge
100 Days of GPU Challenge
☆21Updated last month
githubpradeep / notebooks
☆54Updated 6 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆101Updated 7 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆112Updated 3 months ago
anyscale / multimodal-ai
Multimodal AI workloads: batch inference, model training and online serving.
☆22Updated 2 weeks ago
burtenshaw / course_generator
A repo for generating educational presentation videos.
☆25Updated 2 months ago
ashishpatel26 / ai-tutor-rag-system
This is a repository for the course "From Beginner to LLM Developer" by Towards AI.
☆11Updated 7 months ago
Paulescu / plot-generator-agent
Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️
☆48Updated last year
alexander-moore / vlm
Composition of Multimodal Language Models From Scratch
☆15Updated 11 months ago
cityzen95 / LLM_from_scratch
Building LLMs from scratch following the book from S. Raschka
☆31Updated 4 months ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 2 weeks ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆234Updated last year
13331112522 / v-rag
Visual RAG using less than 300 lines of code.
☆28Updated last year
CVxTz / llm-serve-tutorial
☆20Updated last year
andysingal / LLMops
purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course
☆24Updated last week
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated 2 months ago
fangyuan-ksgk / Mini-LLaVA
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
☆94Updated 7 months ago
huggingface / large-scale-image-deduplication
☆123Updated 2 weeks ago
Arshad221b / RAG-on-PDF
Retrieval-Augmented Generation (RAG) over a Large Language Model (LLM) For PDF data extraction
☆27Updated last year
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆122Updated 6 months ago
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 9 months ago
hesamsheikh / AI-Researcher-Agent
☆19Updated last year
SkalskiP / segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆13Updated last year
AntonioGr7 / pratical-llms
A collection of hand on notebook for LLMs practitioner
☆49Updated 6 months ago
wandb / eval-course
☆25Updated 6 months ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated 2 months ago
Nyandwi / deep-computer-vision
Deep Learning for Computer Vision
☆58Updated last year
fabiomatricardi / How-I-Built-a-Chatbot-that-Crushed-ChatGPT
Repo of the code from the Medium article
☆20Updated last year