TUM-Dev / gocast-voice-service
Microservice that generates subtitles for TUM-Live
☆17Updated 3 months ago
Alternatives and similar repositories for gocast-voice-service:
Users that are interested in gocast-voice-service are comparing it to the libraries listed below
- TUMs lecture streaming service.☆191Updated this week
- New backend written in go with gRPC as an API interface☆16Updated this week
- Proxy for the TUM iCal export to remove clutter☆39Updated last week
- Speaker diarization service☆21Updated 2 weeks ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆11Updated 7 months ago
- proof of concept conversation orchestrator with a speech-language model☆19Updated 6 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 weeks ago
- ☆10Updated this week
- A website providing links, redirects and tools related to the Technical University Munich☆91Updated this week
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆18Updated 5 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Sisyphus recipies for ASR☆16Updated this week
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆21Updated last month
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 3 months ago
- Swarah: Indian-English speech dataset collected across the country☆29Updated last year
- Navigating around TUM with excellence – A website and API to search for rooms, buildings and other places☆54Updated this week
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Audio search using Azure Cognitive Search☆22Updated last year
- A lightweight Python library for running TTS models with a unified API.☆18Updated 2 months ago
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- Mensas erste nützliche Schlangen-Abmessungssoftware☆30Updated 2 years ago
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- On-device noise suppression powered by deep learning☆69Updated last week
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js☆20Updated last year
- TUM Grundlagen Algorithmen und Datenstrukturen 2019 Extras (Tests & co.)☆5Updated 5 years ago
- Gladia SDK for JavaScript/TypeScript☆21Updated last year
- Plugin integrating Artemis programming exercises into IntelliJ☆31Updated last month