chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆61Updated 3 weeks ago
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆160Updated 6 months ago
- ☆77Updated last month
- openai realtime webrtc python client☆42Updated 5 months ago
- The official GitHub Page for MiniMax☆35Updated last week
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- A toolkit for speaker diarization.☆195Updated 2 weeks ago
- ComfyUI wrapper for Moondream's gaze detection☆53Updated 4 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 4 months ago
- openai realtime webrtc demo☆22Updated 4 months ago
- Real time faster whisper gradio☆26Updated 7 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆61Updated 3 weeks ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆34Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 3 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆218Updated last month
- Kyutai with an "eye"☆197Updated 2 months ago
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆121Updated last year
- Jina DeepSearch UI☆110Updated 3 weeks ago
- Model Context Protocol (MCP) server implementation with Minimax API integration☆48Updated last month
- ☆88Updated 2 months ago
- Turn any Hugging Face Space or Gradio application into a discord.js bot.☆11Updated this week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 6 months ago
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 6 months ago
- ☆156Updated 7 months ago
- Demo app for Groq plugins in LiveKit Agents☆50Updated 2 months ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated 9 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated last month
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆108Updated 8 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆87Updated 2 weeks ago
- An AI agent to control drones☆109Updated this week
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want) - now supports Thinking Models!☆155Updated 3 weeks ago