google-research / mozolm
MozoLM: A language model (LM) serving library
☆44Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for mozolm
- Finite-state script normalization and processing utilities☆38Updated this week
- Conversational AI Benchmark.☆65Updated last year
- Sentence Embedding as a Service☆14Updated last year
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Read-only unofficial mirror of OpenFst☆43Updated 2 years ago
- ☆28Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- ☆86Updated 2 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- A JAX library for building lattice-based speech transducer models☆40Updated last month
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆15Updated 2 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- Implementing activation functions from scratch in Tensorflow.☆36Updated 2 years ago
- Cortex-compatible model server for Python and TensorFlow☆17Updated last year
- Generative Retrieval Transformer☆29Updated last year
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆20Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- ☆51Updated 4 years ago
- Tutorial on how to convert machine learned models into ONNX☆15Updated last year
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated 11 months ago
- Development repository for Integrated Speech Corpus Analaysis (ISCAN)☆9Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- ☆74Updated 3 years ago