Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆37Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-finetune-vietnamese
Users that are interested in whisper-finetune-vietnamese are comparing it to the libraries listed below
Sorting:
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆164Oct 29, 2024Updated last year
- ☆16Jun 13, 2022Updated 3 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- Use LoRA technique to improve training Large Language Model☆13Jul 25, 2023Updated 2 years ago
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆10Sep 21, 2022Updated 3 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆58Nov 4, 2023Updated 2 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆13Nov 29, 2022Updated 3 years ago
- ☆15Jan 10, 2023Updated 3 years ago
- This code repo is in reference to the Medium Article for setting up Kaldi on AWS☆12Nov 3, 2019Updated 6 years ago
- Deep Learning and Computer Vision application for chessboard detection and chess pieces classification.☆14Apr 13, 2021Updated 4 years ago
- Scene text vietnamese☆19May 18, 2022Updated 3 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆361May 23, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Solution for MC_OCR competition☆95Mar 7, 2023Updated 3 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Official repo for the Vietnam-Celeb dataset☆26Aug 27, 2023Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- ☆50Oct 12, 2022Updated 3 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆18Nov 13, 2021Updated 4 years ago
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆55Mar 18, 2025Updated 11 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- Key information extraction from invoice document with Graph Convolution Network☆55May 12, 2023Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- [WIP] Scripts for fine-tuning Whisper☆222May 29, 2023Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- The RWTH ASR Toolkit.☆58Updated this week
- ☆49Apr 28, 2023Updated 2 years ago
- ☆25Aug 28, 2024Updated last year