Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-finetune-vietnamese
Users that are interested in whisper-finetune-vietnamese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VietASR - Vietnamese Automatic Speech Recognition☆166Updated this week
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Aug 13, 2024Updated last year
- ☆15Jan 10, 2023Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Scene text vietnamese☆19May 18, 2022Updated 3 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆59Nov 4, 2023Updated 2 years ago
- DaisyKit is an easy AI toolkit with face mask detection, pose detection, background matting, barcode detection, face recognition and more…☆110Oct 24, 2023Updated 2 years ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆361May 23, 2023Updated 2 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Deep Learning and Computer Vision application for chessboard detection and chess pieces classification.☆14Apr 13, 2021Updated 4 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- [WIP] Scripts for fine-tuning Whisper☆221May 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A tutorial on how to train RNN-T from scratch with Whisper encoder☆12Mar 11, 2025Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- ☆50Oct 12, 2022Updated 3 years ago
- Python binding of bark.cpp via Ctypes☆11Jan 1, 2025Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆78Jun 8, 2025Updated 9 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆51Dec 7, 2021Updated 4 years ago
- eKYC (Electronic Know Your Customer) is a project designed to electronically verify the identity of customers☆52Dec 7, 2024Updated last year
- Use LoRA technique to improve training Large Language Model☆13Jul 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Vietnamese speech recognition using Wavenet☆73Feb 2, 2023Updated 3 years ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- This code repo is in reference to the Medium Article for setting up Kaldi on AWS☆12Nov 3, 2019Updated 6 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- ☆26Jan 28, 2024Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- ☆14Oct 7, 2024Updated last year
- Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.☆121Dec 5, 2023Updated 2 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆57Mar 18, 2025Updated last year
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- EmbedRank implemented in Python.☆15Jun 17, 2024Updated last year
- A ChatGPT-like application built with Streamlit for interactive conversation with OpenAI's GPT-3.5 model.☆17Aug 8, 2024Updated last year
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- Keyphrase Extraction Review☆14Dec 17, 2025Updated 3 months ago
- 🔥 Your private task assistant with GPT 🔥 - Ask questions about your documents.☆161Oct 7, 2024Updated last year