deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
☆386Updated this week
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- Open-source autonomous cleaning & housekeeping robot☆251Updated 6 months ago
- Open Source AI Math Notes☆501Updated last year
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,389Updated this week
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆349Updated last year
- Interesting physics-sims generated via LLM prompting.☆268Updated 8 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆143Updated 5 months ago
- Local Video-LLM powered AI Baby Monitor☆480Updated 8 months ago
- podcastfy.ai gradio demo app☆333Updated last year
- Using the moondream VLM with optical flow for promptable object tracking☆73Updated 11 months ago
- Base yolov11n.pt trained on 6877 images of Drones and UAVs☆227Updated 5 months ago
- Control drones with natural language☆167Updated 2 weeks ago
- Build computer vision models in a fraction of the time and with less data.☆443Updated this week
- [AAAI 2025] Event-Enhanced Blurry Video Super-Resolution☆453Updated 2 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆252Updated 6 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆790Updated last week
- Self-hosted voice chat with LLMs☆461Updated 11 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆868Updated last week
- ☆249Updated 7 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆415Updated 6 months ago
- Real-time pose estimation pipeline with 🤗 Transformers☆66Updated 11 months ago
- Translate full-length books and documents with Ollama, OpenAI (comptabible), Gemini, Mistral, Poe or OpenRouter. Preserves formatting. Re…☆481Updated this week
- Hands-On Learning in Computer Vision☆207Updated this week
- Paper Piano uses Python and OpenCV to detect key presses on a hand-drawn piano, translating them into digital notes and sound.☆43Updated last year
- AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.☆528Updated 2 months ago
- EyeTrax – webcam-based eye tracking made simple☆233Updated 4 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆212Updated 3 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆220Updated last year
- gradio WebUI for AdvancedLivePortrait☆525Updated 10 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆298Updated last year
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆239Updated last month