Vietnamese text data crawler scripts for various sites (including Youtube, Facebook, 4rum, news, ...)
☆75Oct 25, 2022Updated 3 years ago
Alternatives and similar repositories for social-scraper
Users that are interested in social-scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆29Apr 7, 2023Updated 2 years ago
- Dùng scrapy-splash kết hợp lua script để crawl các trang web sử dụng Javascript (websosanh)☆16Dec 8, 2022Updated 3 years ago
- Sentiment classification for Vietnamese text using PhoBert☆98Nov 16, 2020Updated 5 years ago
- ntc-scv is dataset of blogs on website https://streetcodevn.com☆26Oct 21, 2021Updated 4 years ago
- Pre-trained Word2Vec models for Vietnamese☆161Dec 30, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Vietnamese sensitive words (including teencode) was created by ML algorithm☆67Jan 13, 2021Updated 5 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆108Sep 24, 2019Updated 6 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- ☆10Dec 10, 2018Updated 7 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Nov 5, 2022Updated 3 years ago
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆265May 22, 2023Updated 2 years ago
- Framework quét dữ liệu trên Internet hỗ trợ render javascript và quét đa nhiệm☆48Jul 6, 2022Updated 3 years ago
- This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…☆38Jun 9, 2025Updated 9 months ago
- Nơi mà tôi sẽ up wu các report lỗ hổng của các trang web☆20Dec 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tutorial phân loại văn bản sử dụng một số thuật toán học máy☆10Aug 8, 2020Updated 5 years ago
- ☆35Aug 1, 2024Updated last year
- Custom ML tracking experiment and debugging tools.☆15Aug 2, 2022Updated 3 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆181May 26, 2025Updated 10 months ago
- AirPlay server with AirTunes support☆21Apr 14, 2021Updated 4 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Machine Learning Project Template - Ready to production☆101Dec 13, 2022Updated 3 years ago
- Large Language Models (LLMs) Learning Resources☆19Jun 16, 2024Updated last year
- Một cuốn sách tập trung vào hướng dẫn cách cấu trúc các dự án Học Máy và phân tích cách làm cho các thuật toán Học Máy hoạt động.☆1,084Oct 13, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- Vietnamese Human-based Text-to-Speech☆13Sep 9, 2012Updated 13 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆149Dec 31, 2024Updated last year
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆371Sep 5, 2022Updated 3 years ago
- Sentence Embeddings with BERT & XLNet☆27Aug 23, 2020Updated 5 years ago
- ☆11Feb 2, 2024Updated 2 years ago
- Source code for Zalo AI 2021 submission☆142Dec 20, 2021Updated 4 years ago
- ☆63Oct 19, 2021Updated 4 years ago
- ☆12Oct 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆48Dec 13, 2019Updated 6 years ago
- Corpus tiếng việt☆385Oct 3, 2025Updated 5 months ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Finetune multiple pre-trained Transformer-based models to solve Vietnamese Fake News Detection problem (ReINTEL) in VLSP2020 shared task☆18Dec 16, 2020Updated 5 years ago
- Include Vietnamese stop words, Vietnamese person names, Vietnam GIS(Geographic Information System) data, Vietnamese Dictionary ...☆15Oct 18, 2017Updated 8 years ago
- Zalo AI Challenge 2020: News Summarization - Runner-up solution☆20Dec 4, 2020Updated 5 years ago
- VietConizer: Vietnamese OCR with NVIDIA DALI☆16Jul 5, 2025Updated 8 months ago