ccoreilly / wav2vec2-service
☆38Updated 3 years ago
Alternatives and similar repositories for wav2vec2-service
Users that are interested in wav2vec2-service are comparing it to the libraries listed below
Sorting:
- ☆36Updated 2 weeks ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 10 months ago
- ☆46Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- A merged version of multiple open-source German speech datasets.☆31Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆138Updated 4 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- The VoxTube dataset official repository☆68Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆149Updated 11 months ago
- Spot the conversation: speaker diarisation in the wild☆138Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆17Updated 4 years ago
- Various speech datasets made available to the public☆117Updated 5 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 3 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated last month
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 8 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆86Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆39Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆63Updated last month
- ☆16Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year