Bud500: A Comprehensive Vietnamese ASR Dataset
☆69Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for Bud500
Users that are interested in Bud500 are comparing it to the libraries listed below
Sorting:
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Jul 1, 2024Updated last year
- Building a Machine Learning Library from scratch using Python3, based on SOTA library Scikit-learn☆15Jan 20, 2023Updated 3 years ago
- MLOps for Image Caption Generator.☆25Nov 27, 2023Updated 2 years ago
- Short experiment with Deep Q-Learning + KAN to play Flappy Bird.☆19Aug 6, 2024Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆277Sep 1, 2025Updated 6 months ago
- ☆78May 4, 2024Updated last year
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- ☆13Apr 12, 2025Updated 10 months ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆103Jun 21, 2024Updated last year
- A collection of Vietnamese Natural Language Processing resources.☆308Oct 28, 2025Updated 4 months ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆370Sep 5, 2022Updated 3 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- PhoGPT: Generative Pre-training for Vietnamese (2023)☆798Nov 12, 2024Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Jan 22, 2024Updated 2 years ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆195Nov 12, 2024Updated last year
- ☆17Sep 30, 2023Updated 2 years ago
- This is a project about Optical Character Recognition (OCR) in Vietnamese texts by using PaddleOCR and VietOCR.☆27Mar 19, 2024Updated last year
- ☆57Feb 25, 2026Updated last week
- A simple YOLOv10 streamlit web demo☆24May 31, 2024Updated last year
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆95Jul 9, 2025Updated 7 months ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆773Jul 23, 2024Updated last year
- VietASR - Vietnamese Automatic Speech Recognition☆164Oct 29, 2024Updated last year
- Python Vietnamese Core NLP Toolkit☆272Sep 26, 2024Updated last year
- ☆24Dec 18, 2025Updated 2 months ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- ☆26Jan 28, 2024Updated 2 years ago
- Another pytorch implementation of Faster RCNN.☆24Feb 17, 2019Updated 7 years ago
- Question Answering in Vietnamese. In a nutshell, this project helps us answer a Question of a given Context in Vietnamese. [UPDATED] This…☆26Nov 17, 2022Updated 3 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Nov 5, 2022Updated 3 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 2 years ago
- Solution for MC_OCR competition☆95Mar 7, 2023Updated 3 years ago
- ☆67Apr 12, 2024Updated last year
- To simplify and streamline LLM operations, empowering developers and organizations to harness the full potential of large language models…☆131Jan 21, 2025Updated last year
- [LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Pr…☆374Dec 31, 2025Updated 2 months ago
- Batch generate images with Bing Image Creator (powered by DALL-E 3) using Seleniumbase.☆36Mar 16, 2024Updated last year