deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision as you edit.
☆377Updated last month
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆2,156Updated last week
- Interesting physics-sims generated via LLM prompting.☆269Updated 6 months ago
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆795Updated 6 months ago
- Open Source AI Math Notes☆496Updated last year
- https://no-ocr.com/about☆167Updated 4 months ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆406Updated 3 months ago
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆347Updated last year
- Open-source autonomous cleaning & housekeeping robot☆237Updated 3 months ago
- An AI agent to control drones from your CLI☆137Updated 3 months ago
- ☆248Updated 4 months ago
- Self-hosted voice chat with LLMs☆463Updated 8 months ago
- ☆80Updated 7 months ago
- Using the moondream VLM with optical flow for promptable object tracking☆71Updated 8 months ago
- A python script designed to translate large amounts of text with an LLM and the Ollama API☆420Updated this week
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆135Updated 2 months ago
- 🧙♂️ Writing by manipulating visual representations of stories☆915Updated 4 months ago
- AI-Powered Video Retrieval & Clipping Tool☆358Updated 2 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆236Updated 10 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆210Updated last month
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆770Updated this week
- gradio WebUI for AdvancedLivePortrait☆516Updated 8 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆217Updated last year
- ☆1,012Updated 3 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆69Updated 3 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆289Updated 10 months ago
- podcastfy.ai gradio demo app☆334Updated 11 months ago
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆66Updated 6 months ago
- ☆468Updated this week
- Turn local files into a prompt for an LLM☆177Updated 9 months ago
- ☆107Updated last month