Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations
☆45Nov 19, 2025Updated 6 months ago
Alternatives and similar repositories for proactive_hearing_assistant
Users that are interested in proactive_hearing_assistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UniMesh: Unifying 3D Mesh Understanding and Generation☆57May 8, 2026Updated last month
- End2End Virtual Try-on with Visual Reference, CVPR2026☆68Apr 18, 2026Updated last month
- Echo-TTS inference codebase☆194Dec 5, 2025Updated 6 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- Work in progress rust bindings to ggml☆11May 1, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a high performance stub server.☆14Sep 3, 2024Updated last year
- A prototype app that controls an iPhone X with face gestures, using Apple's ARKit☆14May 6, 2019Updated 7 years ago
- NPU powered On-device AI Mobile applications using Melange☆60Mar 18, 2026Updated 2 months ago
- Text-to-text alignment algorithm for speech recognition error analysis.☆30Apr 6, 2026Updated 2 months ago
- Sample plugins for Deepstream for Tesla & Jetson☆11Jul 17, 2018Updated 7 years ago
- [CVPR 2026] When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models☆68Apr 11, 2026Updated 2 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆303Jul 15, 2025Updated 11 months ago
- 『入門 GUI(以下、GUI 本)』の第 2 章のサンプルコードです(同人版よりも内容的には進んでいますのでご了承ください)☆13Mar 4, 2023Updated 3 years ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆95Nov 30, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Package implements a number local outlier factor algorithms for outlier detection and finding anomalous data☆12Jun 7, 2017Updated 9 years ago
- MeetNote2 - Zoom Auto-Recording & Transcription App☆14Jan 8, 2025Updated last year
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆118Jan 26, 2026Updated 4 months ago
- ☆22Jan 19, 2026Updated 4 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- A gRPC client library for Firestore, intended to run on Cloud Run.☆13Mar 13, 2020Updated 6 years ago
- Cheat sheet for using curl to work with Cloud Pub/Sub☆13Mar 10, 2022Updated 4 years ago
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 5 months ago
- A ComfyUI and ComfyScript Gradio-based app for generating characters using a multi-step process.☆19Nov 5, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Proof of concept for running moshi/hibiki using webrtc☆21Feb 28, 2025Updated last year
- Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation☆42Mar 6, 2026Updated 3 months ago
- Provides darwin (Mac) binary support on Linux when cross-compiling Go apps that have CGO dependencies☆16Nov 5, 2024Updated last year
- A collection python tools used to create gguf files and upload to huggingface☆17Jun 6, 2026Updated last week
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆118Dec 11, 2025Updated 6 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- A high-precision RAG framework leveraging Baidu ERNIE and Milvus. Features hybrid search and reranking algorithms for accurate PDF parsin…☆62Dec 7, 2025Updated 6 months ago
- ☆22Sep 2, 2025Updated 9 months ago
- A template for new Blender addon projects. Evolves as I muddle my way through. They might not be best practices, but they're mine.☆10Jan 20, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2025] Official implementation of Articulate-Anything☆193Jul 8, 2025Updated 11 months ago
- ☆16Apr 3, 2025Updated last year
- ☆11May 9, 2023Updated 3 years ago
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆150Apr 5, 2026Updated 2 months ago
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆14Jan 23, 2024Updated 2 years ago
- A text-grid web renderer for AI agents — see the web without screenshots☆98Mar 10, 2026Updated 3 months ago
- RedSage: A Cybersecurity Generalist LLM (ICLR'26)☆49May 12, 2026Updated last month