q-viper / image-bakerLinks
Let's bake an image.
☆15Updated last month
Alternatives and similar repositories for image-baker
Users that are interested in image-baker are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆84Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆161Updated last month
- Tools for merging pretrained large language models.☆19Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆89Updated last week
- Notebooks for fine tuning pali gemma☆117Updated 5 months ago
- Eye exploration☆28Updated 7 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- code for training and using chess embeddings models☆12Updated last year
- Create topological graph for image segments.☆22Updated 11 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.☆27Updated 2 weeks ago
- Practical Python exercises on classical computer vision and clean engineering practices☆22Updated 4 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- Lightweight, open-source, high-performance Yolo implementation☆43Updated 3 months ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆33Updated 2 weeks ago
- Solving Computer Vision with AI agents☆33Updated 2 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- ☆20Updated last year
- Vehicle speed estimation using YOLOv8☆30Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆29Updated 6 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆71Updated 6 months ago
- Take your LLM to the optometrist.☆40Updated last month
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 8 months ago
- ☆21Updated 7 months ago
- Experiment and integrate with different OCR frameworks seamlessly☆103Updated last year
- ☆13Updated last year
- Building LLMs from scratch following the book from S. Raschka☆31Updated 5 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 6 months ago
- Open source Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in PyTorch, OpenCV (compiled for G…☆86Updated last year