hardmaru / cantonese-list
List of 4000 Chinese characters sorted by historical usage frequency, with Cantonese yale romanization and definition
☆14Updated 2 years ago
Alternatives and similar repositories for cantonese-list:
Users that are interested in cantonese-list are comparing it to the libraries listed below
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- My explorations into editing the knowledge and memories of an attention network☆34Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated 11 months ago
- Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).☆58Updated 3 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆32Updated 9 months ago
- ☆15Updated 2 years ago
- Text-writing denoising diffusion (and much more)☆30Updated last year
- ☆19Updated this week
- c++ mosestokenizer☆17Updated last year
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 10 months ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆36Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆44Updated 9 months ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Updated 3 years ago
- AdaCat☆49Updated 2 years ago
- ☆31Updated last week
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 2 months ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- LTG-Bert☆31Updated last year
- A generative modelling toolkit for PyTorch.☆70Updated 3 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Updated 2 years ago
- ☆42Updated 3 years ago
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- ☆66Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- ☆33Updated last year
- RWKV model implementation☆37Updated last year