behitek / social-scraper
Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)
☆74Updated 2 years ago
Alternatives and similar repositories for social-scraper:
Users that are interested in social-scraper are comparing it to the libraries listed below
- A Large-scale Vietnamese News Text Classification Corpus☆101Updated 5 years ago
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆267Updated last year
- Vietnamese Chatbot☆92Updated 10 months ago
- Từ điển Họ Tên trong Việt Nam☆93Updated last year
- Sentiment classification for Vietnamese text using PhoBert☆99Updated 4 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆177Updated last year
- Framework quét dữ liệu trên Internet hỗ trợ render javascript và quét đa nhiệm☆47Updated 2 years ago
- Pre-trained Word2Vec models for Vietnamese☆154Updated 4 years ago
- Vietnamese question answering system with BERT☆117Updated 2 years ago
- code and dataset☆150Updated 5 years ago
- Vietnamese stopwords☆180Updated 2 years ago
- Project to share nlp algorithms☆65Updated 6 years ago
- Thư viện xữ lý chữ số dành riêng cho Tiếng Việt.☆75Updated last week
- 1st place solution for Zalo AI 2019 - Vietnamese Wiki Question Answering☆49Updated 5 years ago
- Mô hình ngôn ngữ lớn cho người Việt☆60Updated last year
- MTet: Multi-domain Translation for English and Vietnamese☆180Updated 2 years ago
- Zalo AI chalenge Voice Gender classification (https://challenge.zalo.ai/)☆129Updated 6 years ago
- Corpus tiếng việt☆356Updated 8 months ago
- Solution for MC_OCR competition☆93Updated last year
- Vietnamese language model for spacy.io☆107Updated last year
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆127Updated 6 months ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆51Updated last year
- Python Vietnamese Core NLP Toolkit☆253Updated 4 months ago
- Vietnamese speech recognition using Wavenet☆72Updated 2 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆353Updated 2 years ago
- ☆111Updated last year
- Vietnamese sensitive words (including teencode) was created by ML algorithm☆65Updated 4 years ago
- Bản dịch của cuốn "Interpretable Machine Learning: A Guide for Making Black Box Models Explainable" sang tiếng Việt☆116Updated 3 years ago
- DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)☆66Updated 2 years ago
- Source code for Zalo AI 2021 submission☆139Updated 3 years ago