trannguyenhan / preprocessing-data
Tiền xử lý dữ liệu tiếng Việt với 4 bước
☆14Updated 3 years ago
Alternatives and similar repositories for preprocessing-data
Users that are interested in preprocessing-data are comparing it to the libraries listed below
Sorting:
- Streaming data of Tiki with Kafka and processing with Spark, visualize with Elasticsearch & Kibana.☆11Updated 3 years ago
- Không gian luyện tập và rèn luyện thuật toán.☆90Updated 3 years ago
- Source code for Zalo AI 2021 submission☆140Updated 3 years ago
- Những nội dung cơ bản về Machine Learning dành cho tất cả mọi người☆80Updated 4 years ago
- Parse website and content extraction with jsoup☆9Updated 3 years ago
- Dịch máy giữa ngôn ngữ anh-viet☆51Updated 4 years ago
- Phân tích và thiết kế hệ thống☆49Updated last week
- Bài tập môn công nghệ web và dịch vụ trực tuyến☆11Updated 3 years ago
- Pre-trained Word2Vec models for Vietnamese☆156Updated 4 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆103Updated 9 months ago
- A Large-scale Vietnamese News Text Classification Corpus☆104Updated 5 years ago
- Bring your timetable to Google Calendar☆74Updated 8 months ago
- Applied Phobert model by VinAI research for Vietnamese NER task on various dataset☆20Updated 2 years ago
- Vietnamese stopwords☆184Updated 2 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆359Updated 2 years ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆52Updated last year
- Sentiment classification for Vietnamese text using PhoBert☆98Updated 4 years ago
- code and dataset☆150Updated 5 years ago
- Từ điển Họ Tên trong Việt Nam☆95Updated last year
- A Python wrapper for VnCoreNLP using a bidirectional communication channel.☆56Updated 6 years ago
- Corpus tiếng việt☆358Updated 11 months ago
- ☆112Updated 2 years ago
- Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single…☆128Updated 11 months ago
- Source code in ebook Machine Learning☆173Updated 6 years ago
- A website about Bigdata & Technology from Demanejar team (Vietnamese Language Blogs) https://demanejar.github.io.☆12Updated 2 weeks ago
- Framework quét dữ liệu trên Internet hỗ trợ render javascript và quét đa nhiệm☆47Updated 2 years ago
- Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam☆269Updated last year
- DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)☆67Updated 3 years ago
- Vietnamese sensitive words (including teencode) was created by ML algorithm☆65Updated 4 years ago
- Deep learning Book Vietnamese Translation☆34Updated 6 years ago