Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆29Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for vi
Users that are interested in vi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 20, 2023Updated 2 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)☆75Oct 25, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago
- ☆33May 15, 2024Updated last year
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆24Jun 19, 2021Updated 4 years ago
- Thử nghiệm gần đây mô hình MLP-Mixer trên bài toán nhận diện cảm xúc (Sentiment sentiment analysis)☆13Jul 9, 2021Updated 4 years ago
- A dataset for Vietnamese Spelling Correction☆15Sep 27, 2021Updated 4 years ago
- Use MobileNet SSD and openCV to detect and count car on road☆11Jan 13, 2020Updated 6 years ago
- Corpus tiếng việt☆383Oct 3, 2025Updated 7 months ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆780Jul 23, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- albert-vi-as-service: A Fork of bert-as-service to deploy albert_vi☆11Apr 29, 2020Updated 6 years ago
- ☆50Oct 15, 2025Updated 6 months ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- 🦉 Generate quizzes from video application☆19Apr 6, 2026Updated 3 weeks ago
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- BERTserini☆27Oct 13, 2022Updated 3 years ago
- Scripts to automate simple tasks throughout learning process at UET-VNU☆17Jun 8, 2021Updated 4 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- ☆10Mar 31, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- paraphase sentence☆11Aug 22, 2025Updated 8 months ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- A chrome extension to toggle subtitles using keyboard shortcut (C)☆10Jul 4, 2025Updated 10 months ago
- Bedrock and Claude Deep Dive Workshop☆14Dec 14, 2024Updated last year
- Pre-trained Word2Vec models for Vietnamese☆160Dec 30, 2020Updated 5 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- ☆30Feb 6, 2023Updated 3 years ago
- ☆17Jul 10, 2022Updated 3 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learn how to combine Nginx + wigs + load balancing + flask + unit testing + Docker☆11Jun 2, 2021Updated 4 years ago
- Vietnamese sensitive words (including teencode) was created by ML algorithm☆68Jan 13, 2021Updated 5 years ago
- Llama Sensei: An AI-Powered Learning Assistant☆23Aug 28, 2024Updated last year
- ☆26Jan 28, 2024Updated 2 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆107Sep 24, 2019Updated 6 years ago
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- Ghi chép ban đầu về telegram bot☆11Jun 11, 2018Updated 7 years ago