chentuochao / Spatial-Speech-TranslationLinks
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆64Updated last month
Alternatives and similar repositories for Spatial-Speech-Translation
Users that are interested in Spatial-Speech-Translation are comparing it to the libraries listed below
Sorting:
- ☆160Updated 6 months ago
- The official GitHub Page for MiniMax☆41Updated 3 weeks ago
- A lightweight end-to-end text-to-speech model☆114Updated 4 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆70Updated last week
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆123Updated last year
- ComfyUI wrapper for Moondream's gaze detection☆53Updated 4 months ago
- openai realtime webrtc demo☆22Updated 5 months ago
- Full list of LLM API with Internet Access☆73Updated 4 months ago
- Turn any Hugging Face Space or Gradio application into a discord.js bot.☆11Updated last week
- ☆76Updated 2 months ago
- Kyutai with an "eye"☆200Updated 2 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆101Updated 6 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆258Updated 3 weeks ago
- A toolkit for speaker diarization.☆203Updated 2 weeks ago
- ☆16Updated last year
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 2 months ago
- Jina DeepSearch UI☆114Updated this week
- ☆480Updated last week
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆80Updated 9 months ago
- ☆14Updated 7 months ago
- An agentic workflow for story book generation☆30Updated 3 months ago
- ☆90Updated 3 months ago
- Gradio-powered application that converts audio recordings of meetings into transcripts and provides concise summaries using whisper.☆114Updated 8 months ago
- Googles NotebookLM but local☆291Updated 2 months ago
- openai realtime webrtc python client☆42Updated 5 months ago
- Model Context Protocol (MCP) server implementation with Minimax API integration☆49Updated 2 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 7 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆35Updated 2 months ago
- ☆96Updated last week
- We Speech Transcript based on LLM, in 300 lines of code.☆164Updated this week