chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆69Updated 3 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆166Updated 11 months ago
- The official GitHub Page for MiniMax☆60Updated last week
- ☆527Updated last month
- ☆80Updated 7 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆82Updated 4 months ago
- Kyutai with an "eye"☆223Updated 7 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆114Updated last year
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated 3 weeks ago
- Have a natural voice conversation with an LLM☆258Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆250Updated last month
- an open source ai stylist☆76Updated 4 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆136Updated 2 months ago
- ☆312Updated 2 months ago
- ☆57Updated last year
- openai realtime webrtc python client☆46Updated 10 months ago
- ☆11Updated last year
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆39Updated 11 months ago
- Real time faster whisper gradio☆26Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- Turn any Hugging Face Space or Gradio application into a discord.js bot.☆12Updated this week
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆79Updated last year
- ☆170Updated last year
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆296Updated 5 months ago
- Jina DeepSearch UI☆126Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆123Updated 8 months ago
- Turn local files into a prompt for an LLM☆177Updated 10 months ago
- An opensource implementation of NotebookLM using Deepseek-V3 and PlayHT TTS.☆289Updated 10 months ago
- Realtime Audio SDK for the Web — audio capture, echo cancellation (AEC), voice activity detection (VAD), and real-time encoding (Opus/PCM…☆116Updated last month
- Full list of LLM API with Internet Access☆79Updated 3 months ago
- coze api to openai☆15Updated last year