Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)
☆76Oct 25, 2022Updated 3 years ago
Alternatives and similar repositories for social-scraper
Users that are interested in social-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 3 years ago
- Dùng scrapy-splash kết hợp lua script để crawl các trang web sử dụng Javascript (websosanh)☆16Dec 8, 2022Updated 3 years ago
- Sentiment classification for Vietnamese text using PhoBert☆99Nov 16, 2020Updated 5 years ago
- ntc-scv is dataset of blogs on website https://streetcodevn.com☆27Oct 21, 2021Updated 4 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Large-scale Vietnamese News Text Classification Corpus☆109Sep 24, 2019Updated 6 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 5 years ago
- ☆10Dec 10, 2018Updated 7 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Nov 5, 2022Updated 3 years ago
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆266May 22, 2023Updated 3 years ago
- To collect and promote FOSS projects started by and contributed to by Vietnamese☆12Sep 24, 2018Updated 7 years ago
- Framework quét dữ liệu trên Internet hỗ trợ render javascript và quét đa nhiệm☆48Jul 6, 2022Updated 3 years ago
- ☆15Jun 12, 2023Updated 3 years ago
- Tutorial phân loại văn bản sử dụng một số thuật toán học máy☆10Aug 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Nơi mà tôi sẽ up wu các report lỗ hổng của các trang web☆26Dec 7, 2023Updated 2 years ago
- ☆35Aug 1, 2024Updated last year
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆181May 26, 2025Updated last year
- Machine Learning Project Template - Ready to production☆101Dec 13, 2022Updated 3 years ago
- Một cuốn sách tập trung vào hướng dẫn cách cấu trúc các dự án Học Máy và phân tích cách làm cho các thuật toán Học Máy hoạt động.☆1,089Oct 13, 2021Updated 4 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 4 years ago
- Vietnamese Human-based Text-to-Speech☆13Sep 9, 2012Updated 13 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆149Dec 31, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆375Sep 5, 2022Updated 3 years ago
- Source code for Zalo AI 2021 submission☆141Dec 20, 2021Updated 4 years ago
- ☆61Oct 19, 2021Updated 4 years ago
- ☆12Oct 6, 2024Updated last year
- ☆47Dec 13, 2019Updated 6 years ago
- Corpus tiếng việt☆384Oct 3, 2025Updated 8 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Jun 10, 2023Updated 3 years ago
- dentifying gender and regional accent from speech☆37Aug 21, 2018Updated 7 years ago
- ☆16Jun 17, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Thư viện hổ trợ chuyển đổi số sang chữ số Tiếng Việt.☆21Oct 16, 2021Updated 4 years ago
- ☆21May 9, 2026Updated last month
- Machine Reading Comprehension special for the Vietnamese language☆41Mar 13, 2022Updated 4 years ago
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆788Jul 23, 2024Updated last year
- Lineage 2 Server Emulator in .NET☆27Nov 29, 2025Updated 6 months ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆37Jul 22, 2024Updated last year
- vietnamese OCR☆141Apr 28, 2019Updated 7 years ago