deepaudio / deepaudio-speaker
neural network based speaker embedder
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for deepaudio-speaker
- End-to-end diarization loss☆22Updated 3 years ago
- ☆33Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- ☆55Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- video cut powered by AI☆25Updated 2 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆36Updated 6 months ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆25Updated 4 months ago
- ☆36Updated 2 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆56Updated last year
- ☆41Updated last year
- Torch-based tool for quantizing high-dimensional vectors using additive codebooks☆50Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 3 years ago
- Speech samples and code of BEdit-TTS☆32Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- ☆26Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago