uygarkurt / Fine-Tune-VLMsLinks
☆22Updated 9 months ago
Alternatives and similar repositories for Fine-Tune-VLMs
Users that are interested in Fine-Tune-VLMs are comparing it to the libraries listed below
Sorting:
- 100 Days of GPU Challenge☆23Updated last month
- Fine tune Gemma 3 on an object detection task☆87Updated 3 months ago
- ☆55Updated 2 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆91Updated last month
- Building LLMs from scratch following the book from S. Raschka☆31Updated 7 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 9 months ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆20Updated 10 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ☆25Updated 2 months ago
- An agent to generate stunning images :)☆23Updated 5 months ago
- A repo for generating educational presentation videos.☆26Updated 5 months ago
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆31Updated last year
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- ☆21Updated 11 months ago
- Notebooks for fine tuning pali gemma☆117Updated 6 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- ☆21Updated 9 months ago
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago
- ☆14Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 5 months ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- Visual RAG using less than 300 lines of code.☆29Updated last year
- AI Agents using Crew AI☆12Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆30Updated 9 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆31Updated 9 months ago
- ☆17Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆34Updated 9 months ago
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆18Updated 10 months ago