Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆29Apr 7, 2023Updated 2 years ago
Alternatives and similar repositories for vi
Users that are interested in vi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 20, 2023Updated 2 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)☆75Oct 25, 2022Updated 3 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago
- ☆33May 15, 2024Updated last year
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆21Jun 19, 2021Updated 4 years ago
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Updated this week
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆777Jul 23, 2024Updated last year
- albert-vi-as-service: A Fork of bert-as-service to deploy albert_vi☆11Apr 29, 2020Updated 5 years ago
- 🦉 Generate quizzes from video application☆19Mar 2, 2026Updated 3 weeks ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆149Dec 31, 2024Updated last year
- Scripts to automate simple tasks throughout learning process at UET-VNU☆18Jun 8, 2021Updated 4 years ago
- ☆10Mar 31, 2022Updated 3 years ago
- Visualising Losses in Deep Neural Networks☆16Jul 17, 2024Updated last year
- paraphase sentence☆11Aug 22, 2025Updated 7 months ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- Bedrock and Claude Deep Dive Workshop☆14Dec 14, 2024Updated last year
- ☆24Sep 2, 2022Updated 3 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- ☆30Feb 6, 2023Updated 3 years ago
- ☆17Mar 12, 2021Updated 5 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- Learn how to combine Nginx + wigs + load balancing + flask + unit testing + Docker☆12Jun 2, 2021Updated 4 years ago
- Llama Sensei: An AI-Powered Learning Assistant☆22Aug 28, 2024Updated last year
- ☆47Sep 3, 2025Updated 6 months ago
- ☆26Jan 28, 2024Updated 2 years ago
- This repository contains the solution for determining available spots in parking lots from captured images☆14Sep 29, 2019Updated 6 years ago
- a solid and strong baseline of pedestrian attribute recognition☆40May 23, 2020Updated 5 years ago
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- A Large-scale Vietnamese News Text Classification Corpus☆108Sep 24, 2019Updated 6 years ago
- Ghi chép ban đầu về telegram bot☆11Jun 11, 2018Updated 7 years ago