We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSMEC and ViMMRC.
☆21Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for VietnameseDatasets
Users that are interested in VietnameseDatasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- ☆16Oct 15, 2021Updated 4 years ago
- VIMQA dataset☆13Jul 6, 2022Updated 3 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- A collection of Vietnamese Natural Language Processing resources.☆309Oct 28, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 2 years ago
- DS310.M11 - Xử Lý Ngôn Ngữ Tự Nhiên Cho Khoa Học Dữ Liệu☆17Mar 4, 2022Updated 4 years ago
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- ☆15Jun 27, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- MLOPs human pose estimation end-to-end.☆39Apr 23, 2024Updated last year
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆28Dec 18, 2025Updated 3 months ago
- Phân loại văn bản Tiếng Việt sử dụng pretrained model - PhoBERT☆12Feb 1, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Jun 13, 2019Updated 6 years ago
- Ensemble PhoBERT with FastText Embedding to improve performance on Vietnamese Sentiment Analysis tasks.☆16Jun 29, 2023Updated 2 years ago
- ☆20Nov 19, 2023Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆16Sep 25, 2024Updated last year
- an image processing exercise, grade analysis tool for pdf docs from UET grade publication site☆12May 24, 2021Updated 4 years ago
- Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)☆37Nov 25, 2023Updated 2 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- RP - FO Project S2T1, DSAI HUST☆14Jun 7, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 9 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- ☆48Sep 3, 2025Updated 6 months ago
- This is sample source code for Reinforcement Learning Competition, hosted by FPT-Software (Hanoi, Vietnam). The game is Gold Miner.☆27Sep 25, 2020Updated 5 years ago
- ☆28Feb 18, 2025Updated last year
- TP - AI Project S2T1, DSAI HUST☆19Jun 7, 2022Updated 3 years ago
- Phần mềm nguồn mở giúp mỗi cá nhân trực tiếp sử dụng ChatGPT và hơn thế nữa ngay trên máy tính của mình.☆34Apr 5, 2023Updated 2 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆23Aug 2, 2021Updated 4 years ago
- Image Processing course 2019-2020Semester1☆14Dec 9, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- BFloat16 Fused Adam Operator for PyTorch☆17Nov 16, 2024Updated last year
- ☆23Mar 20, 2024Updated 2 years ago
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆18Sep 19, 2022Updated 3 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆49Jun 3, 2025Updated 9 months ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago