guilinhu / proactive_hearing_assistantView external linksLinks
Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations
☆43Nov 19, 2025Updated 2 months ago
Alternatives and similar repositories for proactive_hearing_assistant
Users that are interested in proactive_hearing_assistant are comparing it to the libraries listed below
Sorting:
- End2End Virtual Try-on with Visual Reference☆57Nov 19, 2025Updated 2 months ago
- An example of ADK Agent for Long-form video generation with Veo 3.1 and Gemini☆86Nov 13, 2025Updated 3 months ago
- FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆24Jan 13, 2026Updated last month
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆110Jan 26, 2026Updated 2 weeks ago
- ☆14Sep 2, 2025Updated 5 months ago
- A collection python tools used to create gguf files and upload to huggingface☆17Updated this week
- Copilot with deepseek and more...☆12Mar 7, 2025Updated 11 months ago
- DreamStyle: A Unified Framework for Video Stylization☆110Jan 7, 2026Updated last month
- (Siggraph Asia 2025) Code of "LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization"☆26Dec 29, 2025Updated last month
- A Cybersecurity Generalist LLM (ICLR'26)☆27Updated this week
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆29Feb 4, 2026Updated last week
- 👋 Dataset and Benchmark code for EgoEdit☆106Dec 11, 2025Updated 2 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Linly-Talker-Stream: Real-Time Streaming Conversational Digital Human System —— Full-duplex, low-latency, real-time interactive digital h…☆23Updated this week
- ☆15Nov 28, 2025Updated 2 months ago
- ☆13Updated this week
- ☆11May 9, 2023Updated 2 years ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆90Nov 30, 2025Updated 2 months ago
- ☆13Jul 2, 2025Updated 7 months ago
- High-performance, semantic turn detection for conversational AI☆34Oct 1, 2025Updated 4 months ago
- ☆17Jun 7, 2022Updated 3 years ago
- ☆31Jan 24, 2026Updated 3 weeks ago
- Firmware Analysis Tool☆16Oct 24, 2025Updated 3 months ago
- This repository presents a comprehensive solution for teeth segmentation on dental X-ray images using the powerful Detectron2 framework. …☆16Nov 10, 2024Updated last year
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆16Jun 22, 2025Updated 7 months ago
- Overworld's local world client interface to run Waypoint world models☆44Updated this week
- LLM Agents Demo using Google ADK☆16Dec 23, 2025Updated last month
- ☆47Dec 8, 2025Updated 2 months ago
- Drax: Speech Recognition with Discrete Flow Matching☆75Oct 15, 2025Updated 3 months ago
- A free Twilio app to let Boston residents call their families while phone coverage is poor.☆90Mar 12, 2016Updated 9 years ago
- This is the official implementation of work HiM2SAM in PRCV25.☆25Aug 30, 2025Updated 5 months ago
- A gRPC client library for Firestore, intended to run on Cloud Run.☆13Mar 13, 2020Updated 5 years ago
- [ICLR 2025] Official implementation of Articulate-Anything☆171Jul 8, 2025Updated 7 months ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Nov 20, 2025Updated 2 months ago
- High-performance TLS/DTLS/QUIC library in 100% safe Rust☆77Nov 19, 2025Updated 2 months ago
- Enhanced Rayhunter Fork - 3x Cellular Data Coverage for IMSI Catcher Detection☆34Jul 20, 2025Updated 6 months ago
- リアク トフローで作るチャット☆16Feb 17, 2025Updated 11 months ago
- ☆16Feb 24, 2025Updated 11 months ago
- ☆15Apr 3, 2025Updated 10 months ago