BTifmmp / paper-piano
Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.
☆33Updated last month
Related projects: ⓘ
- Build LLM-powered robots in your garage with MachinaScript For Robots!☆159Updated 4 months ago
- ☆22Updated 3 weeks ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆54Updated last month
- ☆181Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 3 months ago
- Tiny client for LLMs with vision and tool calling. As simple as it gets.☆75Updated last month
- ⚙️ Zero-Shot Autonomous Robots☆90Updated 5 months ago
- Eye exploration☆20Updated last month
- Each week I create sketches covering key Computer Vision concepts. If you want to learn more about CV stick around!☆146Updated last year
- documentation for content creation☆120Updated this week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆171Updated 2 weeks ago
- servo hardware files for rx1 humanoid robot☆74Updated 2 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆64Updated 9 months ago
- From scratch implementation of a vision language model in pure PyTorch☆149Updated 4 months ago
- Code Implementation for paper NARRATE☆35Updated last month
- Everything you need to know about Transformers! 🤖☆127Updated 10 months ago
- a tiny vectorstore implementation built with numpy.☆50Updated 4 months ago
- Simple and unified interface to zero-shot computer vision models curated for robotics use cases.☆85Updated last week
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes☆74Updated 6 months ago
- SafeDriveVision is a computer vision project aimed at enhancing road safety. This project leverages deep learning models to detect and al…☆61Updated last week
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆39Updated last week
- ☆62Updated 2 months ago
- 100 days challenge of reading and implementing computer vision concepts using popular python libraries like OpenCV and Keras.☆19Updated 3 months ago
- Quick exploration into fine tuning florence 2☆250Updated last month
- GRDN.AI app for garden optimization☆68Updated 7 months ago
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆78Updated this week
- Vehicle speed estimation using YOLOv8☆28Updated 5 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆154Updated 8 months ago
- run paligemma in real time☆122Updated 4 months ago
- Salient feature extractor based on yoloV8☆71Updated last year