OnTrack-UG-Squad / speaker-verification
A public repository of work for the Speech Verification component of the undergrad squad for Doubtfire.
☆13Updated 2 years ago
Related projects: ⓘ
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆12Updated last year
- ☆26Updated this week
- Article about deploying machine learning models using grpc, pytorch and asyncio☆24Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆16Updated last year
- Web-based tool for straight-forward class annotation of audio files☆11Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15Updated 4 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated last year
- ☆26Updated last year
- Simple text to phonemes converter for multiple languages☆21Updated last year
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 5 years ago
- ☆74Updated 2 years ago
- ☆22Updated this week
- DEPRECATED version of SoundFile☆14Updated 4 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆25Updated 3 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- The Seshat audio annotation management platform☆13Updated 3 years ago
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆14Updated last year
- Zero-shot Audio Classification using Whisper☆74Updated last year
- CMPT726 Machine Learning Final Project☆11Updated 5 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- Keras implementations of Tacotron-2☆27Updated 3 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- A library to create and load tfrecord files as tf.data.Dataset☆9Updated 4 months ago
- Accompanying code for the paper: Totally Looks Like - How Humans Compare, Compared to Machines, by Amir Rosenfeld, Markus D. Solbach and …☆39Updated 5 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆10Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Utilities for working with videos☆13Updated 2 years ago