deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
☆386Updated last week
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0…☆2,389Updated this week
- Open Source AI Math Notes☆501Updated last year
- Control drones with natural language☆167Updated 2 weeks ago
- Interesting physics-sims generated via LLM prompting.☆268Updated 8 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆143Updated 5 months ago
- Open-source autonomous cleaning & housekeeping robot☆251Updated 6 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆252Updated 6 months ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆349Updated last year
- Local Video-LLM powered AI Baby Monitor☆480Updated 8 months ago
- https://no-ocr.com/about☆176Updated 7 months ago
- ☆209Updated last year
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆819Updated 9 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆415Updated 6 months ago
- podcastfy.ai gradio demo app☆333Updated last year
- Self-hosted voice chat with LLMs☆461Updated 11 months ago
- ☆79Updated 9 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆212Updated 3 months ago
- ☆947Updated 7 months ago
- Translate full-length books and documents with Ollama, OpenAI (comptabible), Gemini, Mistral, Poe or OpenRouter. Preserves formatting. Re…☆481Updated this week
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆790Updated last week
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆68Updated 8 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆868Updated last week
- gradio WebUI for AdvancedLivePortrait☆525Updated 10 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆221Updated last year
- ☆107Updated 2 weeks ago
- Using the moondream VLM with optical flow for promptable object tracking☆73Updated 11 months ago
- Turn local files into a prompt for an LLM☆177Updated last year
- A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.☆361Updated last year
- ☆11Updated last year
- AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.☆528Updated 2 months ago