1-800-BAD-CODE / punctuators
Package for inference for punctuation, true-casing, and sentence boundary detection
☆25Updated 10 months ago
Alternatives and similar repositories for punctuators:
Users that are interested in punctuators are comparing it to the libraries listed below
- Simple Diarization model☆47Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆41Updated last year
- Official code for Wav2Seq☆96Updated 2 years ago
- ☆77Updated last year
- The case study and multilingfual performance of ICASSP submission☆23Updated 2 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆69Updated last month
- ☆87Updated last week
- ☆56Updated 2 years ago
- ☆19Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated 2 weeks ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆18Updated last year
- ☆56Updated 2 years ago
- ☆18Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆162Updated 6 months ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 7 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 5 months ago
- ☆56Updated 9 months ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆140Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆56Updated last week
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆158Updated 2 weeks ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 8 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Official Code for ParrotTTS☆48Updated 6 months ago
- Official implementation of MelHuBERT☆65Updated 5 months ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago