q-viper / image-bakerLinks
Let's bake an image.
☆15Updated last week
Alternatives and similar repositories for image-baker
Users that are interested in image-baker are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆92Updated 5 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆162Updated 4 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Create topological graph for image segments.☆22Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated 11 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆33Updated last year
- code for training and using chess embeddings models☆13Updated last year
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.☆29Updated last month
- Tools for merging pretrained large language models.☆19Updated last year
- Using the moondream VLM with optical flow for promptable object tracking☆71Updated 9 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆93Updated last week
- 100 Days of GPU Challenge☆24Updated last month
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- Practical Python exercises on classical computer vision and clean engineering practices☆22Updated 7 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆40Updated 2 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆47Updated last year
- Train LLM on Hugging Face infra☆67Updated last month
- A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, a…☆43Updated 3 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 2 weeks ago
- Solving Computer Vision with AI agents☆34Updated 5 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆88Updated 2 years ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆79Updated 3 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆29Updated 9 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Building LLMs from scratch following the book from S. Raschka☆32Updated 8 months ago
- auto_labeler - An all-in-one library to automatically label vision data☆19Updated 11 months ago
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Updated 11 months ago
- Ultralytics Notebooks 🚀☆168Updated last month