q-viper / image-bakerLinks
Let's bake an image.
β16Updated last month
Alternatives and similar repositories for image-baker
Users that are interested in image-baker are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from π€ Transformersβ165Updated 5 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.β29Updated last month
- Fine tune Gemma 3 on an object detection taskβ96Updated 6 months ago
- Practical Python exercises on classical computer vision and clean engineering practicesβ25Updated 9 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β97Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ279Updated 6 months ago
- Create topological graph for image segments.β23Updated last year
- Solving Computer Vision with AI agentsβ35Updated 6 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β85Updated last year
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, aβ¦β44Updated 4 months ago
- Notebooks for fine tuning pali gemmaβ117Updated 9 months ago
- Open source Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in PyTorch, OpenCV (compiled for Gβ¦β87Updated 2 years ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LLβ¦β21Updated last year
- Experiment and integrate with different OCR frameworks seamlesslyβ102Updated last year
- 100 Days of GPU Challengeβ25Updated 2 months ago
- Compare Savant and PyTorch performanceβ13Updated last year
- Using the moondream VLM with optical flow for promptable object trackingβ73Updated 11 months ago
- Vision Transformers for image classification, image segmentation, and object detection.β63Updated 3 months ago
- Join 15k builders to the Real-World ML Newsletter β¬ οΈβ¬οΈβ¬οΈβ47Updated last year
- Eye explorationβ31Updated 2 months ago
- Build Agentic workflows with function calling using open LLMsβ28Updated 3 weeks ago
- β96Updated 2 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β34Updated 2 years ago
- β20Updated last year
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer visβ¦β14Updated last year
- Survey: A collection of AWESOME papers and resources on the latest research in Object Tracking.β23Updated 2 months ago
- β59Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for Frenchβ41Updated 3 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β69Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratchβ85Updated 4 months ago