chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆65Updated 3 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆166Updated 8 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆78Updated 3 weeks ago
- ☆77Updated 3 months ago
- ☆512Updated last month
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆112Updated 9 months ago
- Kyutai with an "eye"☆212Updated 4 months ago
- The official GitHub Page for MiniMax☆49Updated last month
- Have a natural voice conversation with an LLM☆252Updated 8 months ago
- openai realtime webrtc python client☆45Updated 7 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆271Updated 2 months ago
- Googles NotebookLM but local☆342Updated 3 months ago
- Async MCP server with Minimax API integration for image generation and text-to-speech☆49Updated last week
- an open source ai stylist☆67Updated last month
- Jina DeepSearch UI☆120Updated last month
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆40Updated 5 months ago
- A lightweight end-to-end text-to-speech model☆117Updated 5 months ago
- ☆14Updated 8 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆123Updated 10 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 8 months ago
- Turn local files into a prompt for an LLM☆175Updated 6 months ago
- coze api to openai☆14Updated 11 months ago
- Turn any Hugging Face Space or Gradio application into a discord.js bot.☆12Updated last week
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- ☆11Updated 11 months ago
- ☆19Updated 8 months ago
- ☆91Updated last month
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆216Updated 3 months ago
- The showcase page of IndexTTS2☆106Updated last month
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆262Updated last week
- An agentic workflow for story book generation☆30Updated 4 months ago