deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
☆382Updated last month
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- Open Source AI Math Notes☆498Updated last year
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,199Updated 2 weeks ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆348Updated last year
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆809Updated 8 months ago
- Open-source autonomous cleaning & housekeeping robot☆243Updated 5 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 4 months ago
- Mission intent compiler and autonomy supervisor for unmanned systems.☆144Updated 3 weeks ago
- Interesting physics-sims generated via LLM prompting.☆269Updated 7 months ago
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆68Updated 7 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆220Updated last year
- [AAAI 2025] Event-Enhanced Blurry Video Super-Resolution☆450Updated last month
- Local Video-LLM powered AI Baby Monitor☆458Updated 7 months ago
- Self-hosted voice chat with LLMs☆463Updated 10 months ago
- podcastfy.ai gradio demo app☆334Updated last year
- ComfyUI wrapper for Moondream's gaze detection☆55Updated 11 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆847Updated last month
- An opinionated list of awesome Ollama web and desktop uis, frameworks, libraries, software and resources.☆438Updated 11 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆785Updated this week
- ☆209Updated 11 months ago
- https://no-ocr.com/about☆175Updated 6 months ago
- ☆80Updated 8 months ago
- Create and control 3D shapes using hand gestures in real-time. Built with mediapipe computer vision and threejs☆203Updated 6 months ago
- Excalidraw meets ComfyUI for LLMs☆306Updated 4 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆72Updated 10 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆415Updated 5 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆246Updated 5 months ago
- 🧙♂️ Writing by manipulating visual representations of stories☆937Updated 5 months ago
- gradio WebUI for AdvancedLivePortrait☆525Updated 9 months ago
- Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source☆318Updated 6 months ago
- experiments with different llms☆37Updated last year