mush42 / hareef
state-of-the-art models for diacritics restoration for Arabic language
☆10Updated 9 months ago
Alternatives and similar repositories for hareef:
Users that are interested in hareef are comparing it to the libraries listed below
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆24Updated 2 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 4 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆34Updated 2 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 6 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆48Updated last week
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆42Updated 5 months ago
- ☆21Updated 6 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- ☆48Updated 3 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆24Updated 4 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆18Updated 3 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 9 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- Workflow for forced alignment between languages☆17Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- ☆37Updated 10 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆14Updated 2 weeks ago
- Machine learning speaker characteristics☆33Updated last week
- Official Repository For VoxBlink2☆62Updated 6 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Colab notebooks for Next-gen Kaldi☆26Updated last week
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆29Updated 2 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆24Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- The VoxTube dataset official repository☆67Updated last year