The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆73Aug 15, 2025Updated 7 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Jul 31, 2025Updated 8 months ago
- Core ML Demos is an experimental Core ML app. It visualizes the inference results of ML models and can be used to benchmark ML models and…☆12Jan 8, 2026Updated 3 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 6 months ago
- ☆14May 20, 2025Updated 10 months ago
- ☆15Jan 12, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆64Jul 1, 2025Updated 9 months ago
- ☆15Apr 11, 2024Updated last year
- ☆22Aug 21, 2025Updated 7 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆57Aug 15, 2025Updated 7 months ago
- LINEBot☆13Apr 7, 2025Updated last year
- ☆21Jul 15, 2024Updated last year
- In this repository, we deal with developing different estimators to localize Transvahan - the e-vehicle on IISc Campus using measurements…☆19Jul 2, 2020Updated 5 years ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆133Nov 19, 2024Updated last year
- ☆166Nov 29, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆605Oct 26, 2024Updated last year
- ☆20Jul 19, 2024Updated last year
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆384Jan 23, 2026Updated 2 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 8 months ago
- 100% private AI transcription with an intuitive template system for maximum flexibility☆74Jul 27, 2025Updated 8 months ago
- ☆17Jan 31, 2023Updated 3 years ago
- AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference☆21Jan 24, 2025Updated last year
- Langchain desktop app @multi-Agent☆30Jun 8, 2024Updated last year
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆22Jul 25, 2023Updated 2 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 3 months ago
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆47Sep 26, 2025Updated 6 months ago
- Sound Event Localization and Detection using Neural Generalized Cross-Correlations☆33Feb 11, 2025Updated last year
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆40Mar 25, 2026Updated 2 weeks ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated 11 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Jan 4, 2024Updated 2 years ago
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆1,257Jun 29, 2025Updated 9 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆39Jun 20, 2025Updated 9 months ago
- An open source code of the GitHub Copilot Workspace☆13Jun 8, 2024Updated last year
- Voice Activity Detector (VAD) : low-latency, high-performance and lightweight☆2,060Feb 2, 2026Updated 2 months ago
- ☆11Apr 5, 2023Updated 3 years ago