We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSMEC and ViMMRC.
☆21Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for VietnameseDatasets
Users that are interested in VietnameseDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Oct 15, 2021Updated 4 years ago
- VIMQA dataset☆12Jul 6, 2022Updated 3 years ago
- A collection of Vietnamese Natural Language Processing resources.☆311Oct 28, 2025Updated 5 months ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 3 years ago
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated last year
- MLOPs human pose estimation end-to-end.☆39Apr 23, 2024Updated last year
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆280Sep 1, 2025Updated 7 months ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆28Dec 18, 2025Updated 3 months ago
- Phân loại văn bản Tiếng Việt sử dụng pretrained model - PhoBERT☆12Feb 1, 2021Updated 5 years ago
- Scripts to automate simple tasks throughout learning process at UET-VNU☆18Jun 8, 2021Updated 4 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.☆16Jun 29, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Nov 19, 2023Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆16Sep 25, 2024Updated last year
- an image processing exercise, grade analysis tool for pdf docs from UET grade publication site☆12May 24, 2021Updated 4 years ago
- Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)☆37Nov 25, 2023Updated 2 years ago
- RP - FO Project S2T1, DSAI HUST☆14Jun 7, 2022Updated 3 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 10 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- This is sample source code for Reinforcement Learning Competition, hosted by FPT-Software (Hanoi, Vietnam). The game is Gold Miner.☆27Sep 25, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆27Feb 18, 2025Updated last year
- This repo provides Geometric LayoutLM for Vietnamese document and code for export to ONNX☆14Mar 3, 2024Updated 2 years ago
- Phần mềm nguồn mở giúp mỗi cá nhân trực tiếp sử dụng ChatGPT và hơn thế nữa ngay trên máy tính của mình.☆34Apr 5, 2023Updated 3 years ago
- Image Processing course 2019-2020Semester1☆14Dec 9, 2019Updated 6 years ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆18Sep 19, 2022Updated 3 years ago
- Corpus tiếng việt☆383Oct 3, 2025Updated 6 months ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆49Jun 3, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- alm0n for UET's viewgrade☆16Feb 7, 2023Updated 3 years ago
- ☆10Jul 12, 2019Updated 6 years ago
- Seq2seq using LSTM with attention from Luong et al☆10Oct 2, 2018Updated 7 years ago
- ☆12Dec 22, 2024Updated last year
- Code for the DataPipes article☆15Jun 14, 2022Updated 3 years ago