chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆71Updated 4 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- The official GitHub Page for MiniMax☆60Updated last month
- ☆167Updated last year
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆140Updated last week
- ☆532Updated 3 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆84Updated 5 months ago
- Kyutai with an "eye"☆232Updated 9 months ago
- ☆80Updated 8 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆309Updated 3 months ago
- A lightweight end-to-end text-to-speech model☆125Updated 10 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆141Updated 3 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated 10 months ago
- ☆94Updated 5 months ago
- an open source ai stylist☆77Updated 6 months ago
- ☆108Updated 3 weeks ago
- openai realtime webrtc python client☆47Updated last year
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆306Updated 7 months ago
- Have a natural voice conversation with an LLM☆260Updated 2 months ago
- ☆45Updated 4 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆115Updated last year
- ☆122Updated this week
- ☆243Updated last week
- Mission intent compiler and autonomy supervisor for unmanned systems.☆144Updated 2 weeks ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆37Updated last year
- Full list of LLM API with Internet Access☆80Updated 4 months ago
- Real time faster whisper gradio☆25Updated 4 months ago
- Async MCP server with Minimax API integration for image generation and text-to-speech☆51Updated 2 months ago
- ☆333Updated 4 months ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆292Updated 3 months ago
- ☆170Updated last year
- ☆174Updated 4 months ago