Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆29Apr 7, 2023Updated 3 years ago
Alternatives and similar repositories for vi
Users that are interested in vi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 20, 2023Updated 2 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)☆75Oct 25, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆279Sep 1, 2025Updated 7 months ago
- ☆33May 15, 2024Updated last year
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆21Jun 19, 2021Updated 4 years ago
- ☆25Aug 28, 2024Updated last year
- A dataset for Vietnamese Spelling Correction☆15Sep 27, 2021Updated 4 years ago
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Apr 6, 2026Updated last week
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- Corpus tiếng việt☆383Oct 3, 2025Updated 6 months ago
- ☆16Dec 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- albert-vi-as-service: A Fork of bert-as-service to deploy albert_vi☆11Apr 29, 2020Updated 5 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆28Dec 18, 2025Updated 3 months ago
- Machine Reading Comprehension has attracted significant interest in research on natural language understanding, and large-scale datasets …☆10Aug 14, 2021Updated 4 years ago
- Scripts to automate simple tasks throughout learning process at UET-VNU☆18Jun 8, 2021Updated 4 years ago
- Provides a web based "Minecraft Server as a Service" (MCaaS?) to deploy Minecraft server containers on any Docker Swarm cluster or standa…☆12Jun 16, 2017Updated 8 years ago
- ☆21Jun 13, 2019Updated 6 years ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Apr 4, 2023Updated 3 years ago
- Visualising Losses in Deep Neural Networks☆16Jul 17, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Mar 31, 2022Updated 4 years ago
- paraphase sentence☆11Aug 22, 2025Updated 7 months ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- Bedrock and Claude Deep Dive Workshop☆14Dec 14, 2024Updated last year
- ☆24Sep 2, 2022Updated 3 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- ☆21Sep 3, 2024Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆30Feb 6, 2023Updated 3 years ago
- ☆17Mar 12, 2021Updated 5 years ago
- A tutorial to set up a running compute cluster on cloud resources☆11Jul 7, 2023Updated 2 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- Vietnamese sensitive words (including teencode) was created by ML algorithm☆67Jan 13, 2021Updated 5 years ago
- Llama Sensei: An AI-Powered Learning Assistant☆22Aug 28, 2024Updated last year
- ☆26Jan 28, 2024Updated 2 years ago