jsrpy / Chinese-NLP-JiebaView external linksLinks
This is an introduction to Chinese words segmentation using Jieba.
☆14May 31, 2018Updated 7 years ago
Alternatives and similar repositories for Chinese-NLP-Jieba
Users that are interested in Chinese-NLP-Jieba are comparing it to the libraries listed below
Sorting:
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆10Jan 14, 2021Updated 5 years ago
- With the rapid adoption of smartphones, tablets, and mobile apps, they are increasingly becoming part of children’s daily life for amusem…☆12Apr 7, 2017Updated 8 years ago
- Rails 3 task to dump (parts) of your database to db/seeds.rb☆48Sep 4, 2012Updated 13 years ago
- PyCSVSchema is an implementation of CSV Schema in Python.☆12May 18, 2020Updated 5 years ago
- Generate an appropriate question from a passage.☆10May 18, 2021Updated 4 years ago
- Scrapes headlines from CNN and FOX, then has ChatGPT do cross-analysis☆11Apr 19, 2023Updated 2 years ago
- A massively multilingual corpus and pretrained model for IGT☆12Updated this week
- Text Alignment with Smith-Waterman☆11Nov 26, 2025Updated 2 months ago
- ☆13Jan 8, 2021Updated 5 years ago
- ☆11Dec 30, 2015Updated 10 years ago
- List of resources, tools, and models for NLP in Georgian☆13May 3, 2024Updated last year
- 深度学习神经网络 构建源码。☆10Aug 8, 2019Updated 6 years ago
- This code implements a basic, Twitter-aware tokenizer.☆12Feb 8, 2024Updated 2 years ago
- Senior A.I. project to generate realistic news articles like those found on CNN, NYTimes, Fox News, etc. Future research will involve con…☆15Apr 26, 2019Updated 6 years ago
- Language-agnostic Code Statistics☆24Mar 3, 2012Updated 13 years ago
- A python wechat official account scrawler, using sougou search engine.☆13Aug 11, 2017Updated 8 years ago
- Set up an automated data science environment using Docker☆14Oct 2, 2018Updated 7 years ago
- ☆14Mar 2, 2018Updated 7 years ago
- TikTok-Teller: A TikTok Video Scraping and Content Analysis Tool☆19Nov 20, 2023Updated 2 years ago
- Set of functions to convert between ISCO codes and to compute class classification schemes (EGP, ESeC, etc.)☆15May 4, 2022Updated 3 years ago
- Simple Python wrapper for querying data with TikTok's research API☆13Dec 25, 2023Updated 2 years ago
- Python code implementing the piecewise segmentation of a signal given in input. Three main algorithms (sliding windows, top down and bott…☆10Mar 15, 2016Updated 9 years ago
- Coptic NLP pipeline page and utilities☆16Feb 11, 2025Updated last year
- ☆15Dec 7, 2015Updated 10 years ago
- Source code of the experiment documented in: Pitsilis G.K, Ramampiaro, H., Langseth, H., “Detecting Offensive Language in Tweets Using De…☆20May 21, 2021Updated 4 years ago
- 甜橙金融杯Rank14☆17Dec 18, 2018Updated 7 years ago
- Script that fetches comments from a TikTok post☆16Apr 27, 2023Updated 2 years ago
- L2M2 is a minimal Python library for asynchronous, multi-provider LLM inference.☆29Dec 14, 2025Updated 2 months ago
- Classification of incivility in Reddit posts☆18Nov 19, 2020Updated 5 years ago
- An OCR application focused on machine-print Tibetan text.☆18Jun 29, 2018Updated 7 years ago
- TensorFlow code and pre-trained models for BERT☆18Dec 18, 2019Updated 6 years ago
- A python package for classifying emotion☆18Oct 20, 2020Updated 5 years ago
- Incivility classifier used in Theocharis et al (2020, Sage Open)☆20Aug 29, 2022Updated 3 years ago
- 2019 CAIL 法研杯机器阅读理解挑战赛 第8名 解决方案☆16Jul 13, 2019Updated 6 years ago
- General Utilities☆47Jan 24, 2026Updated 3 weeks ago
- Scala Map Reduce☆51Sep 24, 2008Updated 17 years ago
- Open Letter to University Leaders☆19Apr 6, 2020Updated 5 years ago
- The Syriac Old Testament, also known as Peshitta, in Text-Fabric☆27Jan 23, 2024Updated 2 years ago
- CHIP2018评测任务2,平安医疗科技智能患者健康咨询问句匹配大赛baseline,BiLSTM+特征工程计算相似性,10折交叉验证平均投票做bagging,F1值0.83左右,rank16。☆19Dec 4, 2018Updated 7 years ago