ayaka14732 / gpt4-cantonese-english-translator
A Cantonese-English translator based on prompt engineering
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpt4-cantonese-english-translator
- An English-to-Cantonese machine translation model☆49Updated 7 months ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆20Updated 2 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆24Updated last year
- 粵文語料篩選器 Cantonese text filter☆33Updated 2 months ago
- JAX implementation of the bart-base model☆29Updated last year
- A Python script for scraping LIHKG☆30Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆85Updated 3 years ago
- rime-cantonese 上游詞表倉庫☆27Updated 2 months ago
- Online BaseHangul Encoder And Decoder☆12Updated last year
- 粵語對話語料☆24Updated last year
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆46Updated 8 months ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆14Updated 2 years ago
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆63Updated last month
- A frequency lexicon for Hong Kong Cantonese☆20Updated 4 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆22Updated last month
- TrAVis: Visualise BERT attention in your browser☆55Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- Transformers for Cantonese☆54Updated 4 years ago
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- Evaluating LLMs with Dynamic Data☆72Updated last week
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- OpenAI Whisper Prompt Examples☆48Updated last year
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆34Updated 3 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Updated 3 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- ☆54Updated this week
- ☆35Updated last year
- 粵語拼音轉換表☆31Updated 6 months ago