The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆74Aug 15, 2025Updated 10 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Core ML Demos is an experimental Core ML app. It visualizes the inference results of ML models and can be used to benchmark ML models and…☆12Jan 8, 2026Updated 5 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 9 months ago
- ☆14May 20, 2025Updated last year
- Microphone Array Real-time System☆13Jun 7, 2017Updated 9 years ago
- ☆65Jul 1, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆15Apr 11, 2024Updated 2 years ago
- ☆22Aug 21, 2025Updated 10 months ago
- Run DeepSeek R1 model on an Ubuntu single board computer without user registration.☆14Jun 8, 2026Updated 3 weeks ago
- LINEBot☆13Apr 7, 2025Updated last year
- ☆21Jul 15, 2024Updated last year
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance" (CVPR`26)☆85May 3, 2026Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆133Nov 19, 2024Updated last year
- A Deep Q Reinforcement Learning Demo☆17May 2, 2026Updated last month
- Async MCP server with Minimax API integration for image generation and text-to-speech☆50Jan 29, 2026Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆609Oct 26, 2024Updated last year
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆390Jan 23, 2026Updated 5 months ago
- [ICCV 2025] DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness☆187Feb 11, 2026Updated 4 months ago
- ☆17Jan 31, 2023Updated 3 years ago
- Langchain desktop app @multi-Agent☆30Jun 8, 2024Updated 2 years ago
- ☆109Apr 4, 2026Updated 2 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆21Nov 3, 2025Updated 7 months ago
- A unified robotic manipulation learning framework☆23Sep 4, 2025Updated 9 months ago
- ☆21Jul 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆26Apr 26, 2026Updated 2 months ago
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆49Apr 17, 2026Updated 2 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- Playground that demonstrates advanced uses of Swift's Codable☆19Sep 23, 2018Updated 7 years ago
- Big Impulse Response Dataset☆159Oct 19, 2022Updated 3 years ago
- [ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models☆89May 20, 2025Updated last year
- StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.☆1,274Jun 29, 2025Updated last year
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- code for A Large-scale Dataset for Audio-Language Representation Learning☆14Sep 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 已迁移到👇这个仓库☆47Aug 29, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month
- Voice Activity Detector (VAD) : low-latency, high-performance and lightweight☆2,173Feb 2, 2026Updated 4 months ago
- ☆11Apr 5, 2023Updated 3 years ago
- 借助cloudflare tunnel实现在容器平台的frp内网穿透☆50Apr 17, 2025Updated last year
- Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654☆37Feb 11, 2025Updated last year
- ☆20Dec 19, 2023Updated 2 years ago