q-viper / image-bakerLinks
Let's bake an image.
☆14Updated 2 weeks ago
Alternatives and similar repositories for image-baker
Users that are interested in image-baker are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆78Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆87Updated last week
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆161Updated 3 weeks ago
- code for training and using chess embeddings models☆12Updated last year
- Create topological graph for image segments.☆22Updated 11 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.☆21Updated this week
- Notebooks for fine tuning pali gemma☆114Updated 4 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report☆30Updated this week
- Tools for merging pretrained large language models.☆19Updated last year
- 100 Days of GPU Challenge☆21Updated 2 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 10 months ago
- Lightweight, open-source, high-performance Yolo implementation☆42Updated 3 months ago
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 6 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆29Updated last week
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 8 months ago
- Building LLMs from scratch following the book from S. Raschka☆31Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorch☆239Updated last year
- A high-performance library for detecting objects in images and videos, leveraging Rust's speed and safety. Optionally supports a gRPC API…☆32Updated 4 months ago
- Practical Python exercises on classical computer vision and clean engineering practices☆21Updated 4 months ago
- This is a courseware for DataScienceWithPython☆15Updated 2 years ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆123Updated this week
- ☆49Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 8 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆70Updated 6 months ago
- auto_labeler - An all-in-one library to automatically label vision data☆16Updated 7 months ago
- ☆128Updated last month