This is an introduction to Chinese words segmentation using Jieba.
☆14May 31, 2018Updated 7 years ago
Alternatives and similar repositories for Chinese-NLP-Jieba
Users that are interested in Chinese-NLP-Jieba are comparing it to the libraries listed below
Sorting:
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆10Jan 14, 2021Updated 5 years ago
- With the rapid adoption of smartphones, tablets, and mobile apps, they are increasingly becoming part of children’s daily life for amusem…☆12Apr 7, 2017Updated 8 years ago
- Generate an appropriate question from a passage.☆10May 18, 2021Updated 4 years ago
- Rails 3 task to dump (parts) of your database to db/seeds.rb☆48Sep 4, 2012Updated 13 years ago
- Scrapes headlines from CNN and FOX, then has ChatGPT do cross-analysis☆11Apr 19, 2023Updated 2 years ago
- PyCSVSchema is an implementation of CSV Schema in Python.☆12May 18, 2020Updated 5 years ago
- ☆11Dec 30, 2015Updated 10 years ago
- ☆13Jan 8, 2021Updated 5 years ago
- A massively multilingual corpus and pretrained model for IGT☆14Feb 21, 2026Updated 2 weeks ago
- Text Alignment with Smith-Waterman☆11Nov 26, 2025Updated 3 months ago
- A comprehensive list of Natural Language Processing resources for the Georgian language☆14Feb 19, 2026Updated 2 weeks ago
- 深度学习神经网络构建源码。☆10Aug 8, 2019Updated 6 years ago
- Language-agnostic Code Statistics☆24Mar 3, 2012Updated 14 years ago
- Senior A.I. project to generate realistic news articles like those found on CNN, NYTimes, Fox News, etc. Future research will involve con…☆15Apr 26, 2019Updated 6 years ago
- This code implements a basic, Twitter-aware tokenizer.☆12Feb 8, 2024Updated 2 years ago
- ☆14Mar 2, 2018Updated 8 years ago
- Set up an automated data science environment using Docker☆14Oct 2, 2018Updated 7 years ago
- A python wechat official account scrawler, using sougou search engine.☆13Aug 11, 2017Updated 8 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Python code implementing the piecewise segmentation of a signal given in input. Three main algorithms (sliding windows, top down and bott…☆10Mar 15, 2016Updated 9 years ago
- Set of functions to convert between ISCO codes and to compute class classification schemes (EGP, ESeC, etc.)☆15May 4, 2022Updated 3 years ago
- TikTok-Teller: A TikTok Video Scraping and Content Analysis Tool☆19Nov 20, 2023Updated 2 years ago
- Coptic NLP pipeline page and utilities