SELMA-project / ml4audioLinks
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Updated 2 years ago
Alternatives and similar repositories for ml4audio
Users that are interested in ml4audio are comparing it to the libraries listed below
Sorting:
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆57Updated last year
- ☆20Updated 11 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- ☆157Updated 2 years ago
- Coqui AI TTS plugin☆85Updated 7 months ago
- 🐍 Coqui's machine learning job scheduler☆31Updated 4 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- whisper.cpp bindings for python☆110Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- All-in-one Speech Transcription☆10Updated 2 weeks ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated 2 weeks ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- Dockerfile and web server for running GPT-J-6B on AWS GPU instances☆18Updated 4 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 5 years ago