This is an introduction to Chinese words segmentation using Jieba.
☆14May 31, 2018Updated 7 years ago
Alternatives and similar repositories for Chinese-NLP-Jieba
Users that are interested in Chinese-NLP-Jieba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python wechat official account scrawler, using sougou search engine.☆13Aug 11, 2017Updated 8 years ago
- Text Alignment with Smith-Waterman☆11Nov 26, 2025Updated 4 months ago
- Scrapes headlines from CNN and FOX, then has ChatGPT do cross-analysis☆11Apr 19, 2023Updated 2 years ago
- ☆11Dec 30, 2015Updated 10 years ago
- With the rapid adoption of smartphones, tablets, and mobile apps, they are increasingly becoming part of children’s daily life for amusem…☆12Apr 7, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Set of functions to convert between ISCO codes and to compute class classification schemes (EGP, ESeC, etc.)☆15May 4, 2022Updated 3 years ago
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆10Jan 14, 2021Updated 5 years ago
- A comprehensive list of Natural Language Processing resources for the Georgian language☆14Feb 19, 2026Updated last month
- A massively multilingual corpus and pretrained model for IGT☆13Mar 13, 2026Updated 2 weeks ago
- ☆13Jan 8, 2021Updated 5 years ago
- This code implements a basic, Twitter-aware tokenizer.☆12Feb 8, 2024Updated 2 years ago
- Senior A.I. project to generate realistic news articles like those found on CNN, NYTimes, Fox News, etc. Future research will involve con…☆15Apr 26, 2019Updated 6 years ago
- ☆14Mar 2, 2018Updated 8 years ago
- Coptic NLP pipeline page and utilities☆16Feb 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A collection of tricky code☆24Apr 30, 2017Updated 8 years ago
- 深度学习神经网络构建源码。☆10Aug 8, 2019Updated 6 years ago
- An OCR application focused on machine-print Tibetan text.☆18Jun 29, 2018Updated 7 years ago
- 甜橙金融杯Rank14☆17Dec 18, 2018Updated 7 years ago
- Source code of the experiment documented in: Pitsilis G.K, Ramampiaro, H., Langseth, H., “Detecting Offensive Language in Tweets Using De…☆20May 21, 2021Updated 4 years ago
- TensorFlow code and pre-trained models for BERT☆18Dec 18, 2019Updated 6 years ago
- ☆96Nov 12, 2025Updated 4 months ago
- Python code implementing the piecewise segmentation of a signal given in input. Three main algorithms (sliding windows, top down and bott…☆10Mar 15, 2016Updated 10 years ago
- Classification of incivility in Reddit posts☆18Nov 19, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- TikTok-Teller: A TikTok Video Scraping and Content Analysis Tool☆20Nov 20, 2023Updated 2 years ago
- The Syriac Old Testament, also known as Peshitta, in Text-Fabric☆28Jan 23, 2024Updated 2 years ago
- Script that fetches comments from a TikTok post☆16Apr 27, 2023Updated 2 years ago
- 2019 CAIL 法研杯机器阅读理解挑战赛 第8名 解决方案☆16Jul 13, 2019Updated 6 years ago
- Incivility classifier used in Theocharis et al (2020, Sage Open)☆20Aug 29, 2022Updated 3 years ago
- Deep learning model for animal sound classification.☆35May 4, 2024Updated last year
- "fork" of PyMonad on BitBucket to change the ``*`` functor/composition operator to ``<<``☆31Jul 22, 2014Updated 11 years ago
- Creating a model for the recognition and classification of ancient Egyptian Hieroglyphs. Using transfer learning on convolutional neural …☆24Nov 30, 2025Updated 4 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- CHIP2018评测任务2,平安医疗科技智能患者健康咨询问句匹配大赛baseline,BiLSTM+特征工程计算相似性,10折交叉验证平均投票做bagging,F1值0.83左右,rank16。☆19Dec 4, 2018Updated 7 years ago
- A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.☆26Apr 6, 2025Updated 11 months ago
- A python package for classifying emotion☆18Oct 20, 2020Updated 5 years ago
- annotated hateful speech☆24Apr 6, 2019Updated 6 years ago
- Rails 3 task to dump (parts) of your database to db/seeds.rb☆48Sep 4, 2012Updated 13 years ago
- ☆58Mar 20, 2023Updated 3 years ago
- The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…☆32Jan 9, 2025Updated last year