Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
Alternatives and similar repositories for CantoASR
Users that are interested in CantoASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- Free, open-source, offline, safe and secure AI Cantonese transcription, in your device.☆19Nov 7, 2025Updated 6 months ago
- wav2vec2 asr with transformers☆16Oct 26, 2021Updated 4 years ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Aug 9, 2021Updated 4 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated 3 weeks ago
- First neural GPT aligned with text and speech. Welcome to join us to make better foundation model in neural modality.☆14Oct 30, 2024Updated last year
- ☆16Aug 1, 2025Updated 9 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 5 years ago
- 语音识别 论文 前沿☆53Jan 8, 2022Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆40Dec 30, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆16Mar 4, 2022Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 4 years ago
- View and compare Bible Translations in an innovative interlinear format. Run on Windows or Web.☆14May 15, 2026Updated last week
- ☆16Nov 11, 2025Updated 6 months ago
- ☆17Nov 30, 2021Updated 4 years ago
- FunAudioLLM homepage☆17Dec 11, 2024Updated last year
- Leveraging Social Influence based on Users Activity Centers for Point-of-Interest Recommendation☆13Nov 21, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- Cantonese Selfish Project 廣東話自肥企劃 at PYCON HK 2021☆15Feb 16, 2022Updated 4 years ago
- ☆25Jun 19, 2025Updated 11 months ago
- ICASSP2026 HumDial Challenge☆45Dec 13, 2025Updated 5 months ago
- Road crack detection project based on NestedUnet model☆22Jan 22, 2022Updated 4 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Transformers for Cantonese☆58Oct 24, 2020Updated 5 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- A Python script for scraping LIHKG☆32Mar 7, 2022Updated 4 years ago
- A simple 3d sound head related transfer function (HRTF) implementation.☆23Dec 25, 2015Updated 10 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Feb 10, 2022Updated 4 years ago
- A Jekyll theme based off of mdbook☆30Sep 28, 2020Updated 5 years ago
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated 2 years ago