deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
☆377Updated last week
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,132Updated this week
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆349Updated 11 months ago
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆772Updated 5 months ago
- Open-source autonomous cleaning & housekeeping robot☆235Updated last month
- Open Source AI Math Notes☆492Updated last year
- An AI agent to control drones from your CLI☆130Updated last month
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.☆2,993Updated last week
- ☆244Updated 2 months ago
- 🧙♂️ Writing by manipulating visual representations of stories☆818Updated 2 months ago
- ☆77Updated 5 months ago
- Base yolov11n.pt trained on 6877 images of Drones and UAVs☆171Updated last month
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆400Updated last month
- EyeTrax – webcam-based eye tracking made simple☆181Updated 3 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆741Updated 3 months ago
- podcastfy.ai gradio demo app☆335Updated 9 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆71Updated 6 months ago
- Self-hosted voice chat with LLMs☆461Updated 6 months ago
- Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.☆43Updated last year
- ☆104Updated this week
- https://no-ocr.com/about☆164Updated 2 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆129Updated 2 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆67Updated last month
- Turn local files into a prompt for an LLM☆176Updated 8 months ago
- [AAAI 2025] Event-Enhanced Blurry Video Super-Resolution☆438Updated 5 months ago
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆63Updated 4 months ago
- Interesting physics-sims generated via LLM prompting.☆268Updated 4 months ago
- AI-Powered Video Retrieval & Clipping Tool☆337Updated 3 weeks ago
- ☆168Updated 10 months ago
- KeyForge3D is an app that turns a photo of a key into a 3D-printable STL file. Ideal for locksmiths and hobbyists, it analyzes the key's …☆229Updated 5 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆213Updated 10 months ago