PranavPutsa1006 / Speaker-DiarizationLinks
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Updated last year
Alternatives and similar repositories for Speaker-Diarization
Users that are interested in Speaker-Diarization are comparing it to the libraries listed below
Sorting:
- Speaker diarization service☆23Updated last month
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- ☆15Updated 2 months ago
- asr2k☆50Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 8 months ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Tunable pipelines☆34Updated 3 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Code for AccentDB.☆22Updated 4 years ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated last month
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Deep Speech Distances PyTorch☆28Updated 3 years ago
- ☆56Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆51Updated 2 years ago
- ☆76Updated 3 years ago