BTifmmp / paper-pianoLinks
Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.
β43Updated last year
Alternatives and similar repositories for paper-piano
Users that are interested in paper-piano are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from π€ Transformersβ161Updated 3 weeks ago
- EyeTrax β webcam-based eye tracking made simpleβ180Updated 3 months ago
- β113Updated 9 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.β21Updated this week
- Using the moondream VLM with optical flow for promptable object trackingβ70Updated 6 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vβ¦β124Updated 2 months ago
- β113Updated 2 months ago
- Build LLM-powered robots in your garage with MachinaScript For Robots!β189Updated 11 months ago
- β65Updated 6 months ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β29Updated 6 months ago
- Eye explorationβ28Updated 6 months ago
- β75Updated 3 months ago
- β98Updated 2 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementaβ¦β227Updated 8 months ago
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!β79Updated 10 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ267Updated last month
- From scratch implementation of a vision language model in pure PyTorchβ239Updated last year
- Ultralytics Notebooks πβ104Updated last week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated 8 months ago
- βοΈ Zero-Shot Autonomous Robotsβ116Updated last year
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scripβ¦β61Updated last year
- No longer maintained:Your personal ArXiv Curatorβ40Updated 9 months ago
- Retrieval-augmented generation (RAG) for remote & local LLM useβ45Updated 3 months ago
- Practical Python exercises on classical computer vision and clean engineering practicesβ21Updated 4 months ago
- YOLOv10: Real-Time End-to-End Object Detectionβ11Updated last year
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposesβ77Updated 8 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.β214Updated last month
- A RAG system designed to process documents with multimodal content. It can generate factual, context-aware answers to user queries, basedβ¦β25Updated 8 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLMβ106Updated 2 months ago
- β51Updated last month