chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆64Updated 2 months ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆165Updated 7 months ago
- Kyutai with an "eye"☆207Updated 3 months ago
- The official GitHub Page for MiniMax☆47Updated last week
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆73Updated this week
- ☆500Updated 3 weeks ago
- Model Context Protocol (MCP) server implementation with Minimax API integration☆49Updated 3 months ago
- ☆77Updated 3 months ago
- Googles NotebookLM but local☆315Updated 2 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆121Updated 9 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆220Updated this week
- openai realtime webrtc python client☆42Updated 6 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆261Updated last month
- Real time faster whisper gradio☆26Updated 9 months ago
- an open source ai stylist☆64Updated last week
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- ☆19Updated last week
- openai realtime webrtc demo☆22Updated 6 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆112Updated 8 months ago
- Full list of LLM API with Internet Access☆73Updated 5 months ago
- ☆14Updated 7 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 5 months ago
- Jina DeepSearch UI☆117Updated 2 weeks ago
- Have a natural voice conversation with an LLM☆250Updated 7 months ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year
- Auto Thinking Mode switch for Qwen3 in Open webui☆66Updated 2 months ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆77Updated 2 weeks ago
- ☆171Updated 10 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 7 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 2 months ago
- coze api to openai☆14Updated 10 months ago