unoti / voice-embeddingsLinks
Audio processing using deep neural networks. Speaker identification using voice embeddings.
☆13Updated 2 years ago
Alternatives and similar repositories for voice-embeddings
Users that are interested in voice-embeddings are comparing it to the libraries listed below
Sorting:
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Dataset Release for Intent Classification from Speech☆47Updated 4 months ago
- A 🔥 cookiecutter template for building Hugging Face Spaces☆11Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- Text to Speech for Indic languages☆51Updated 3 years ago
- ☆13Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 3 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 4 months ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Speech in Flax/JAX☆15Updated 3 years ago
- App to explore latent spaces of music collections☆34Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- A Pytorch Implementations for Various Vector Quantization Methods☆30Updated 3 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- ☆12Updated 3 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago
- ☆16Updated 4 months ago
- ☆23Updated 2 years ago