google-research / mozolm
MozoLM: A language model (LM) serving library
☆44Updated 2 months ago
Alternatives and similar repositories for mozolm:
Users that are interested in mozolm are comparing it to the libraries listed below
- Finite-state script normalization and processing utilities☆38Updated this week
- Conversational AI Benchmark.☆65Updated last year
- ☆86Updated 2 years ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Read-only unofficial mirror of OpenFst☆43Updated 2 years ago
- Sentence Embedding as a Service☆14Updated last year
- **ARCHIVED** Filesystem interface to 🤗 Hub☆57Updated last year
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12Updated 7 months ago
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last week
- Tutorial on how to convert machine learned models into ONNX☆16Updated last year
- A JAX library for building lattice-based speech transducer models☆41Updated last month
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆18Updated last year
- A Streamlit app to add structured tags to a dataset card☆22Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆74Updated 3 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- Development repository for Integrated Speech Corpus Analaysis (ISCAN)☆9Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Efficiently computing & storing token n-grams from large corpora☆17Updated 3 months ago
- benchmarking some transformer deployments☆26Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆52Updated 11 months ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- 3rd party dependencies for DALI project☆10Updated this week
- Seed Machine Translation Data☆30Updated 2 months ago