hardmaru / cantonese-list

List of 4000 Chinese characters sorted by historical usage frequency, with Cantonese yale romanization and definition

☆14

Alternatives and similar repositories for cantonese-list:

Users that are interested in cantonese-list are comparing it to the libraries listed below

google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆34Updated 2 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆26Updated 11 months ago
basusourya / mirostat
Code for the paper-"Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm" (https://arxiv.org/abs/2007.14966).
☆58Updated 3 years ago
arogozhnikov / adamw_bfloat16
AdamW optimizer for bfloat16 models in pytorch 🔥.
☆32Updated 9 months ago
EleutherAI / exploring-contrastive-topology
☆15Updated 2 years ago
nostalgebraist / improved-diffusion
Text-writing denoising diffusion (and much more)
☆30Updated last year
ethansmith2000 / TransformerExperiments
☆19Updated this week
mingruimingrui / fast-mosestokenizer
c++ mosestokenizer
☆17Updated last year
AeroScripts / HiddenEngrams
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Updated 3 years ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated 10 months ago
sanchit-gandhi / seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆35Updated 2 years ago
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆36Updated last year
alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆44Updated 9 months ago
cccwam / rc2020_electra
ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers
☆12Updated 3 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 2 years ago
google-research / precondition
☆31Updated last week
UKPLab / on-emergence
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Updated 2 months ago
srush / torch-golf
Silly twitter torch implementations.
☆46Updated 2 years ago
google-research-datasets / Disfl-QA
A Benchmark Dataset for Understanding Disfluencies in Question Answering
☆62Updated 3 years ago
ltgoslo / ltg-bert
LTG-Bert
☆31Updated last year
Popgun-Labs / PopGen
A generative modelling toolkit for PyTorch.
☆70Updated 3 years ago
neulab / newlang-tech
A guide to building language technology in new languages.
☆58Updated 3 years ago
ChenghaoMou / embeddings
zero-vocab or low-vocab embeddings
☆18Updated 2 years ago
harvardnlp / hmm-lm
☆42Updated 3 years ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated last year
zphang / minimal-opt
☆66Updated 2 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 2 years ago
sustcsonglin / gated_linear_attention_layer
☆33Updated last year
codekansas / rwkv
RWKV model implementation
☆37Updated last year