telexyz / vi
Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for vi
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆111Updated last year
- Vietnamese self-supervised Wav2vec2 model☆60Updated 2 years ago
- ☆12Updated 2 years ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆19Updated 4 months ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆109Updated last year
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆99Updated 3 months ago
- Pre-training script for BART in JAX/Flax☆37Updated 2 years ago
- ☆59Updated last year
- ☆59Updated 6 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆24Updated 6 months ago
- Machine Reading Comprehension special for the Vietnamese language☆38Updated 2 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆40Updated 3 months ago
- ☆25Updated 9 months ago
- Sentiment classification for Vietnamese text using PhoBert☆96Updated 4 years ago
- ☆46Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Dịch máy giữa ngôn ngữ anh-viet☆49Updated 4 years ago
- RAG for Vietnamese Wikipedia corpus.☆23Updated 11 months ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆31Updated 3 months ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆31Updated 2 years ago
- ☆17Updated 2 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆49Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year
- Bud500: A Comprehensive Vietnamese ASR Dataset☆64Updated 8 months ago
- ☆25Updated 2 months ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆67Updated 9 months ago
- top 1 Zalo AI challenge 2021 task hum to song☆108Updated 2 years ago
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆123Updated 3 months ago
- ☆46Updated 3 months ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆39Updated 3 months ago