revdotcom / revai-python-sdk
Rev AI Python SDK
☆35Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for revai-python-sdk
- Rev.ai Java SDK☆17Updated 3 weeks ago
- Node.js SDK for the Rev AI API☆22Updated last week
- Rev.ai golang client☆21Updated 8 months ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆158Updated 3 weeks ago
- Variational Bayes HMM over x-vectors diarization☆254Updated 10 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Tutorial on Kaldi for Brandeis ASR course☆76Updated 4 years ago
- ASR with PyTorch☆140Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated last year
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆91Updated 2 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Diarization scoring tools.☆220Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- Various speech datasets made available to the public☆99Updated 2 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Convert words to numbers☆20Updated 2 years ago
- Moved to https://github.com/k2-fsa/icefall☆144Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆188Updated last year
- A Python toolbox for speech features extraction☆159Updated last year
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- Speech Enhancement using Bayesian WaveNet☆96Updated 6 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- Server framework for Kaldi ASR Toolkit☆97Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆99Updated 5 months ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆200Updated 3 years ago