ayaka14732 / gpt4-cantonese-english-translator
A Cantonese-English translator based on prompt engineering
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpt4-cantonese-english-translator
- An English-to-Cantonese machine translation model☆49Updated 7 months ago
- cantonese-mandarin unsupervised neural translation for sw project☆24Updated last year
- BERT Tokenizer with vocabulary tailored for Cantonese☆19Updated 2 years ago
- JAX implementation of the bart-base model☆29Updated last year
- 粵文語料篩選器 Cantonese text filter☆33Updated 2 months ago
- TrAVis: Visualise BERT attention in your browser☆55Updated last year
- Online BaseHangul Encoder And Decoder☆12Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆14Updated 2 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆44Updated 7 months ago
- Transformers for Cantonese☆54Updated 4 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆85Updated 3 years ago
- Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project☆16Updated last year
- 粵語拼音自動標註工具 Cantonese Pronunciation Automatic Labeling Tool☆62Updated last month
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- A Python script for scraping LIHKG☆30Updated 2 years ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- The official code for our EMNLP 2022 long paper [Breaking the Representation Bottleneck of Chinese Characters: Neural Machine Translation…☆22Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆25Updated last year
- ☆27Updated last year
- ROUGE score calculator with traditional chinese word segmentation☆9Updated 3 years ago
- A frequency lexicon for Hong Kong Cantonese☆20Updated 4 years ago
- 粵語對話語料☆24Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- ☆22Updated 2 months ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆13Updated 7 months ago
- Taiwanese Hokkien Transliterator and Tokeniser☆24Updated 2 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆72Updated 11 months ago
- Say goodbye to long and boring videos 👋☆36Updated 2 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆22Updated 3 weeks ago