chentuochao/Spatial-Speech-Translation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chentuochao/Spatial-Speech-Translation)

chentuochao / Spatial-Speech-Translation

The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"

☆74

Alternatives and similar repositories for Spatial-Speech-Translation

Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YutongWen / GuideSep
View on GitHub
☆31Jul 31, 2025Updated 11 months ago
RoboCupAtHome / Montreal2018
View on GitHub
Data and data for the RoboCup world championship 2018 taking place in Montreal, Canada
☆10Dec 17, 2018Updated 7 years ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
OriMeAI / VisionY
View on GitHub
AI Makes Creativity Visible
☆15Aug 27, 2025Updated 10 months ago
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jimmyliao / linebot
View on GitHub
LINEBot
☆13Apr 7, 2025Updated last year
vineeths96 / TDOA-Localization
View on GitHub
In this repository, we deal with developing different estimators to localize Transvahan - the e-vehicle on IISc Campus using measurements…
☆20Jul 2, 2020Updated 6 years ago
farewellthree / PPLLaVA
View on GitHub
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆133Nov 19, 2024Updated last year
nethermanpro / transvip
View on GitHub
☆164Nov 29, 2024Updated last year
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
chentuochao / Sound_Bubble
View on GitHub
Project for speech bubble
☆66Aug 15, 2025Updated 11 months ago
MYZY-AI / Muyan-TTS
View on GitHub
☆480May 19, 2025Updated last year
camenduru / Matting-Anything-colab
View on GitHub
☆10Jul 25, 2023Updated 2 years ago
PsychArch / minimax-mcp-tools
View on GitHub
Async MCP server with Minimax API integration for image generation and text-to-speech
☆50Jan 29, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
lihuithe / podlm-public
View on GitHub
☆608Oct 26, 2024Updated last year
Deep-unlearning / Finetune-Dia-TTS
View on GitHub
☆22Aug 21, 2025Updated 11 months ago
HumanMLLM / CoGenAV
View on GitHub
☆64Jul 1, 2025Updated last year
etrobot / next-langchain-tauri
View on GitHub
Langchain desktop app @multi-Agent
☆30Jun 8, 2024Updated 2 years ago
neuroailab / SpelkeNet
View on GitHub
☆15Jul 23, 2025Updated 11 months ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated 11 months ago
camenduru / ECON-colab
View on GitHub
☆21Jul 25, 2023Updated 2 years ago
Haschtl / transcripy
View on GitHub
Multi speaker audio transcription
☆46Nov 25, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FrancoisGrondin / BIRD
View on GitHub
Big Impulse Response Dataset
☆159Oct 19, 2022Updated 3 years ago
trustmlyoungscientist / EDPA_attack_defense
View on GitHub
Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models
☆18Dec 12, 2025Updated 7 months ago
thunlp / Migician
View on GitHub
[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
☆90May 20, 2025Updated last year
axeber01 / ngcc
View on GitHub
Neural Generalized Cross Correlations https://arxiv.org/abs/2208.04654
☆37Feb 11, 2025Updated last year
LoieSun / Auto-ACD
View on GitHub
code for A Large-scale Dataset for Audio-Language Representation Learning
☆14Sep 18, 2024Updated last year
ictnlp / StreamSpeech
View on GitHub
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
☆1,277Jun 29, 2025Updated last year
bushkarl / videoprocessor
View on GitHub
智能视频处理系统
☆46Dec 26, 2024Updated last year
ruvnet / open-space
View on GitHub
An open source code of the GitHub Copilot Workspace
☆13Jun 8, 2024Updated 2 years ago
mshojaei77 / ReActMCP
View on GitHub
ReActMCP is a reactive MCP client that empowers AI assistants to instantly respond with real-time, Markdown-formatted web search insights…
☆142Mar 19, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
secondpathstudio / privatescribe
View on GitHub
100% private AI transcription using local LLMs, fully encrypted database, and extensive administrative tools
☆90Jun 30, 2026Updated 3 weeks ago
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
chentuochao / LlamaPIE
View on GitHub
Github repo for paper: LlamaPIE: Proactive In-Ear Conversation Assistants
☆21Apr 17, 2026Updated 3 months ago
realtime-ai / realtime-ai
View on GitHub
A real-time Agent framework for audio and video.
☆182Feb 17, 2026Updated 5 months ago
DMS3tv / fastgraph
View on GitHub
Fastgraph is a tool for streamlined bulk headphone measurements. ASIO support is experimental.
☆29Updated this week
DensoITLab / bitprune
View on GitHub
☆11Apr 5, 2023Updated 3 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago