lhotse-speech / lhotseLinks
Tools for handling multimodal data in machine learning projects.
☆1,026Updated last week
Alternatives and similar repositories for lhotse
Users that are interested in lhotse are comparing it to the libraries listed below
Sorting:
- Large, modern dataset for speech recognition☆677Updated last year
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,208Updated 2 weeks ago
- ☆1,121Updated 3 weeks ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆938Updated last month
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆463Updated last year
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆446Updated last year
- List of speech synthesis papers.☆1,045Updated last year
- An Open Source Tools for Speaker Recognition☆618Updated 10 months ago
- End-to-End Neural Diarization☆402Updated 3 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆973Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆421Updated 2 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆541Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆337Updated last year
- g2p: English Grapheme To Phoneme Conversion☆858Updated 2 years ago
- A library for speech data augmentation in time-domain☆664Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,336Updated last year
- Variational Bayes HMM over x-vectors diarization☆269Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆213Updated 4 months ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆742Updated 4 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆802Updated 6 months ago
- A CRF-based ASR Toolkit☆334Updated this week
- Diarization scoring tools.☆246Updated 2 years ago
- Towards hot directions in industrial end to end speech recognition☆326Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,052Updated 5 months ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆691Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,607Updated last year
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆587Updated last year
- NeMo text processing for ASR and TTS☆340Updated last week
- End-to-end ASR/LM implementation with PyTorch☆596Updated 3 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆369Updated 3 years ago