q-viper / image-bakerLinks
Let's bake an image.
☆15Updated last week
Alternatives and similar repositories for image-baker
Users that are interested in image-baker are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆162Updated 5 months ago
- Fine tune Gemma 3 on an object detection task☆95Updated 5 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆95Updated 2 weeks ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆34Updated 2 years ago
- Create topological graph for image segments.☆22Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆72Updated 10 months ago
- Solving Computer Vision with AI agents☆35Updated 6 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆30Updated 10 months ago
- Take your LLM to the optometrist.☆43Updated 3 weeks ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆132Updated 3 weeks ago
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆44Updated 4 months ago
- Eye exploration☆31Updated last month
- 100 Days of GPU Challenge☆24Updated last month
- Vision Transformers for image classification, image segmentation, and object detection.☆63Updated 2 months ago
- ☆34Updated last year
- Practical Python exercises on classical computer vision and clean engineering practices☆24Updated 8 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- code for training and using chess embeddings models☆13Updated last year
- auto_labeler - An all-in-one library to automatically label vision data☆19Updated 11 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated this week
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- ☆26Updated 2 years ago
- ☆56Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆252Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 2 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆47Updated last year