hardmaru / cantonese-listLinks
List of 4000 Chinese characters sorted by historical usage frequency, with Cantonese yale romanization and definition
☆14Updated 3 years ago
Alternatives and similar repositories for cantonese-list
Users that are interested in cantonese-list are comparing it to the libraries listed below
Sorting:
- c++ mosestokenizer☆18Updated last year
- Autoregressive transformer in JAX from scratch☆23Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Updated last year
- A generative modelling toolkit for PyTorch.☆70Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 2 weeks ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- ☆58Updated 3 years ago
- A guide to building language technology in new languages.☆59Updated 3 years ago
- RWKV model implementation☆38Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- A stateful pytree library for training neural networks.☆22Updated 4 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- AdaCat☆49Updated 3 years ago
- ☆28Updated 4 years ago
- A flexible sentence segmentation library using CRF model and regex rules☆31Updated 3 months ago
- Library for fast text representation and classification.☆31Updated 2 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆36Updated 10 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Updated 2 years ago
- Latent Diffusion Language Models☆70Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 3 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Updated 3 years ago
- ☆32Updated 2 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆118Updated 4 years ago
- Datasets for turn-taking research☆17Updated 2 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆39Updated last year
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆93Updated 2 years ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆61Updated 3 years ago
- ☆92Updated 3 years ago