chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆69Updated last month
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆166Updated 10 months ago
- ☆522Updated last week
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆81Updated 2 months ago
- ☆77Updated 5 months ago
- The official GitHub Page for MiniMax☆55Updated 3 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆113Updated 11 months ago
- an open source ai stylist☆74Updated 3 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆129Updated last month
- Have a natural voice conversation with an LLM☆255Updated last week
- Kyutai with an "eye"☆221Updated 6 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆173Updated last week
- ☆258Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆285Updated 4 months ago
- A lightweight end-to-end text-to-speech model☆120Updated 7 months ago
- Turn any Hugging Face Space or Gradio application into a discord.js bot.☆12Updated last week
- ☆464Updated 4 months ago
- ☆121Updated this week
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated 2 weeks ago
- coze api to openai☆15Updated last year
- ☆42Updated last month
- ☆105Updated 2 weeks ago
- Jina DeepSearch UI☆126Updated last month
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆281Updated 2 weeks ago
- openai realtime webrtc python client☆45Updated 9 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 8 months ago
- ☆170Updated last year
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆282Updated 9 months ago
- Real time faster whisper gradio☆26Updated last month
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆128Updated last year
- ☆20Updated 10 months ago