A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM 3, and Qwen3-VL.
β9,206Feb 19, 2026Updated last week
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- We write your reusable computer vision tools. πβ36,543Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,659Updated this week
- This repository contains demos I made with the Transformers library by HuggingFace.β11,506Jan 13, 2026Updated last month
- Images to inference with no labeling (use foundation models to train supervised models).β2,634May 14, 2025Updated 9 months ago
- Turn any computer or edge device into a command center for your computer vision projects.β2,202Updated this week
- Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.β5,007Updated this week
- This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.β1,158Jan 23, 2025Updated last year
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.β75,460Feb 5, 2026Updated 3 weeks ago
- Ultralytics YOLO πβ53,508Updated this week
- This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)β6,401Apr 22, 2024Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β13,182Feb 22, 2026Updated last week
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoiβ¦β53,497Sep 18, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by stepβ86,149Feb 19, 2026Updated last week
- π A ranked list of awesome machine learning Python libraries. Updated weekly.β23,250Updated this week
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API π₯β1,683Jan 14, 2025Updated last year
- πΊ Discover the latest machine learning / AI courses on YouTube.β17,101Jan 22, 2024Updated 2 years ago
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and β¦β17,409Sep 5, 2024Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β52,724Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,478Aug 12, 2024Updated last year
- β8,653Sep 22, 2024Updated last year
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"β9,760Aug 12, 2024Updated last year
- π Collection of Kaggle Solutions and Ideas πβ6,334Feb 5, 2026Updated 3 weeks ago
- Free MLOps course from DataTalks.Clubβ14,259Dec 1, 2025Updated 3 months ago
- Machine Learning Engineering Open Bookβ17,162Feb 21, 2026Updated last week
- Explanation to key concepts in MLβ8,530Jun 30, 2025Updated 8 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.β12,427Updated this week
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data β¦β11,333Jan 13, 2026Updated last month
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,220Nov 3, 2025Updated 3 months ago
- [CVPR 2024] Real-Time Open-Vocabulary Object Detectionβ6,217Feb 26, 2025Updated last year
- The programming language for agentic software. Build, run, and manage multi-agent systems at scale.β38,104Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,883Jan 9, 2026Updated last month
- [ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed forβ¦β5,740Updated this week
- Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.β23,218Oct 28, 2025Updated 4 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information rβ¦β25,581Feb 17, 2026Updated last week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"β23,021Dec 17, 2025Updated 2 months ago
- LLM Finetuning with peftβ2,818Aug 1, 2025Updated 7 months ago
- DSPy: The framework for programmingβnot prompting βlanguage modelsβ32,381Updated this week
- A course on aligning smol models.β6,587Feb 6, 2026Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,360Updated this week