google-research / mozolm
MozoLM: A language model (LM) serving library
☆42Updated 4 months ago
Related projects: ⓘ
- Finite-state script normalization and processing utilities☆36Updated this week
- Read-only unofficial mirror of OpenFst☆41Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Conversational AI Benchmark.☆63Updated last year
- A JAX library for building lattice-based speech transducer models☆39Updated 5 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 3 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 6 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated last year
- aiXplain enables python programmers to add AI functions to their software.☆24Updated last week
- ☆74Updated 2 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 5 years ago
- A database of number names for 186 languages, locales, and scripts☆66Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆31Updated 2 months ago
- Speech in Flax/JAX☆15Updated 2 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- Simple text to phonemes converter for multiple languages☆21Updated last year
- A GPU language model, based on btree backed tries.☆29Updated 6 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆64Updated 4 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆41Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆60Updated 3 years ago
- Experiments with Hugging Face 🔬 🤗☆45Updated last month
- A collection of useful tools for handling speech recognition data☆30Updated last year
- docker for HF wav2vec2-sprint☆12Updated 3 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- ☆11Updated 9 years ago