Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
Alternatives and similar repositories for CantoASR
Users that are interested in CantoASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Free, open-source, offline, safe and secure AI Cantonese transcription, in your device.☆19Nov 7, 2025Updated 7 months ago
- wav2vec2 asr with transformers☆16Oct 26, 2021Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Aug 28, 2019Updated 6 years ago
- ☆12Aug 9, 2021Updated 4 years ago
- cantonese-mandarin unsupervised neural translation for sw project☆29May 2, 2023Updated 3 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Oct 27, 2022Updated 3 years ago
- ☆13Sep 23, 2025Updated 8 months ago
- code for Multisample-based Contrastive Loss for Top-k Recommendation (IEEE TMM)☆10Nov 23, 2022Updated 3 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Jun 5, 2026Updated last week
- ☆16Aug 1, 2025Updated 10 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 语音识别 论文 前沿☆53Jan 8, 2022Updated 4 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆41Dec 30, 2020Updated 5 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆16Mar 4, 2022Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆43May 6, 2025Updated last year
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- This project focuses on the classification of animal sounds using deep learning. The core idea is to utilize audio processing techniques …☆10Dec 3, 2024Updated last year
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Nov 11, 2025Updated 7 months ago
- FunAudioLLM homepage☆17Dec 11, 2024Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆27Nov 14, 2024Updated last year
- ICASSP2026 HumDial Challenge☆47May 28, 2026Updated 2 weeks ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- ☆41May 15, 2023Updated 3 years ago
- An English-to-Cantonese machine translation model☆55Mar 26, 2025Updated last year
- Implementaion RNN tranceducer☆23Jun 25, 2019Updated 6 years ago
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Transformers for Cantonese☆58Oct 24, 2020Updated 5 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- A simple 3d sound head related transfer function (HRTF) implementation.☆23Dec 25, 2015Updated 10 years ago
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 7 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago