sonlam1102 / vihsd
A large-scale dataset for Vietnamese hate speech detection
☆20Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for vihsd
- Sentiment classification for Vietnamese text using PhoBert☆96Updated 4 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆49Updated last year
- ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset☆15Updated 2 years ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆109Updated last year
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆65Updated 3 months ago
- ☆12Updated 3 years ago
- ☆30Updated 2 years ago
- Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)☆33Updated 11 months ago
- Vietnamese question answering system with BERT☆116Updated last year
- 1st place solution for Zalo AI 2019 - Vietnamese Wiki Question Answering☆49Updated 4 years ago
- Source code for Zalo AI 2021 submission☆136Updated 2 years ago
- top 1 Zalo AI challenge 2021 task hum to song☆108Updated 2 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆101Updated 5 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆99Updated 3 months ago
- A collection of Vietnamese Natural Language Processing resources.☆223Updated 4 months ago
- ☆102Updated last year
- ☆39Updated 5 years ago
- Applied Phobert model by VinAI research for Vietnamese NER task on various dataset☆14Updated 2 years ago
- Build English-Vietnamese machine translation with ProtonX Transformer. :D☆64Updated 3 years ago
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆123Updated 3 months ago
- My own Docker tutorial for self-learning☆93Updated 2 years ago
- Large Language Models (LLMs) Learning Resources☆17Updated 5 months ago
- Pre-training script for BART in JAX/Flax☆37Updated 2 years ago
- Bản dịch của cuốn "Interpretable Machine Learning: A Guide for Making Black Box Models Explainable" sang tiếng Việt☆113Updated 3 years ago
- code and dataset☆148Updated 5 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆346Updated 2 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆176Updated last year
- DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)☆65Updated 2 years ago
- Pre-trained Word2Vec models for Vietnamese☆152Updated 3 years ago
- A spell corrector and text classifier using Deep Neural Network☆34Updated 3 years ago