Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations
☆43Nov 19, 2025Updated 4 months ago
Alternatives and similar repositories for proactive_hearing_assistant
Users that are interested in proactive_hearing_assistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End2End Virtual Try-on with Visual Reference, CVPR2026☆61Mar 29, 2026Updated 2 weeks ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆41Mar 24, 2026Updated 3 weeks ago
- Echo-TTS inference codebase☆165Dec 5, 2025Updated 4 months ago
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆52Updated this week
- Work in progress rust bindings to ggml☆12May 1, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- NPU powered On-device AI Mobile applications using Melange☆53Mar 18, 2026Updated 3 weeks ago
- A library for consuming evdev-capable devices☆13Apr 13, 2022Updated 4 years ago
- ☆15Dec 8, 2022Updated 3 years ago
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Sep 20, 2024Updated last year
- Text-to-text alignment algorithm for speech recognition error analysis.☆28Apr 6, 2026Updated last week
- superfast text to speech in any voice☆62Feb 16, 2026Updated 2 months ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆18Jun 22, 2025Updated 9 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆92Nov 30, 2025Updated 4 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆299Jul 15, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆114Jan 26, 2026Updated 2 months ago
- Fud AI is an open sourced and Free AI calorie tracker for iOS☆102Apr 1, 2026Updated 2 weeks ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- A gRPC client library for Firestore, intended to run on Cloud Run.☆13Mar 13, 2020Updated 6 years ago
- DreamStyle: A Unified Framework for Video Stylization☆117Jan 7, 2026Updated 3 months ago
- (Siggraph Asia 2025) Code of "LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization"☆26Dec 29, 2025Updated 3 months ago
- High-performance, semantic turn detection for conversational AI☆36Oct 1, 2025Updated 6 months ago
- A ComfyUI and ComfyScript Gradio-based app for generating characters using a multi-step process.☆19Nov 5, 2025Updated 5 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆74Mar 26, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Proof of concept for running moshi/hibiki using webrtc☆20Feb 28, 2025Updated last year
- RedSage: A Cybersecurity Generalist LLM (ICLR'26)☆39Apr 7, 2026Updated last week
- Multi-Agent Team for Creating Long-form Videos☆146Mar 11, 2026Updated last month
- A collection python tools used to create gguf files and upload to huggingface☆17Mar 28, 2026Updated 2 weeks ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆115Dec 11, 2025Updated 4 months ago
- ☆12Oct 14, 2024Updated last year
- A high-precision RAG framework leveraging Baidu ERNIE and Milvus. Features hybrid search and reranking algorithms for accurate PDF parsin…☆59Dec 7, 2025Updated 4 months ago
- A template for new Blender addon projects. Evolves as I muddle my way through. They might not be best practices, but they're mine.☆10Jan 20, 2026Updated 2 months ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆138Apr 5, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2026 Highlight] Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision☆104Updated this week
- ☆16Apr 3, 2025Updated last year
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆14Jan 23, 2024Updated 2 years ago
- [ICLR 2026] Mobile-GS: Real-time Gaussian Splatting for Mobile Devices☆229Mar 30, 2026Updated 2 weeks ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 6 months ago
- ☆15Nov 28, 2025Updated 4 months ago
- An isomorphic version of Node.js's require("node:util").inspect API☆34Jun 7, 2025Updated 10 months ago