We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSMEC and ViMMRC.
☆20Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for VietnameseDatasets
Users that are interested in VietnameseDatasets are comparing it to the libraries listed below
Sorting:
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- ☆16Oct 15, 2021Updated 4 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- ☆30Jul 21, 2022Updated 3 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Phần mềm nguồn mở giúp mỗi cá nhân trực tiếp sử dụng ChatGPT và hơn thế nữa ngay trên máy tính của mình.☆34Apr 5, 2023Updated 2 years ago
- MLOPs human pose estimation end-to-end.☆39Apr 23, 2024Updated last year
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Jul 12, 2025Updated 7 months ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Cheatsheet for slurm command lines☆10Apr 9, 2023Updated 2 years ago
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- Enhancing Domain Adaptation through Prompt Gradient Alignment (NeurIPS 2024)☆14Jun 16, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- ☆13Oct 9, 2025Updated 4 months ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Whisper finetuning☆16Apr 9, 2025Updated 10 months ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆48Feb 23, 2025Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 3 years ago
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆277Sep 1, 2025Updated 6 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆11Jun 14, 2024Updated last year
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- ☆13Oct 3, 2025Updated 5 months ago