deltacv / PaperVisionLinks
Create your custom OpenCV algorithms using a user-friendly node editor interface, inspired by Blender and Unreal Engine blueprints! Quickly prototype your vision using live previews as you edit.
☆371Updated 2 months ago
Alternatives and similar repositories for PaperVision
Users that are interested in PaperVision are comparing it to the libraries listed below
Sorting:
- Open Source AI Math Notes☆491Updated last year
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithms☆1,851Updated this week
- This repository contains the code for a virtual try-on application built using Flask, Twilio's WhatsApp API, and Gradio's virtual try-on …☆346Updated 8 months ago
- [AAAI 2025] Event-Enhanced Blurry Video Super-Resolution☆415Updated 2 months ago
- Securade.ai HUB - A generative AI based edge platform for computer vision that connects to existing CCTV cameras and makes them smart.☆203Updated last week
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆387Updated last month
- Interesting physics-sims generated via LLM prompting.☆263Updated 2 months ago
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.☆2,331Updated this week
- An AI agent to control drones☆115Updated this week
- Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)☆737Updated 2 months ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]☆675Updated last month
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆121Updated 9 months ago
- https://no-ocr.com/about☆161Updated 2 weeks ago
- podcastfy.ai gradio demo app☆334Updated 7 months ago
- ☆171Updated 11 months ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆208Updated 8 months ago
- Excalidraw meets ComfyUI for LLMs☆268Updated this week
- ☆260Updated last week
- Computer use SDK for building agents that learn from human screen recordings. Accessibility-first. Cross-platform (Windows/macOS/Linux), …☆703Updated last week
- Self-hosted voice chat with LLMs☆432Updated 4 months ago
- ☆77Updated 3 months ago
- Realtime Voice and Vision wtih Brilliant Labs Frame and Gemini☆58Updated 2 months ago
- ☆206Updated 5 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆208Updated 6 months ago
- An opinionated list of awesome Ollama web and desktop uis, frameworks, libraries, software and resources.☆391Updated 5 months ago
- KeyForge3D is an app that turns a photo of a key into a 3D-printable STL file. Ideal for locksmiths and hobbyists, it analyzes the key's …☆119Updated 3 months ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆684Updated last month
- AI-powered assistant to help you with your daily tasks, powered by Llama 3, DeepSeek R1, and many more models on HuggingFace.☆506Updated 4 months ago
- Voice-Enabled Math Tutor Powered by Groq that Calculates and Renders Live Problems and Instruction with LaTeX in Seconds!☆228Updated 6 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆299Updated 6 months ago