howl-anderson/Chinese_tokenizer_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/howl-anderson/Chinese_tokenizer_benchmark)

howl-anderson / Chinese_tokenizer_benchmark

中文分词软件基准测试 | Chinese tokenizer benchmark

☆26

Alternatives and similar repositories for Chinese_tokenizer_benchmark

Users that are interested in Chinese_tokenizer_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

howl-anderson / MicroTokenizer
View on GitHub
一个轻量且功能全面的中文分词器，帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a…
☆159Oct 18, 2024Updated last year
hangyav / UnsupPSE
View on GitHub
Unsupervised parallel sentence extraction from comparable corpora
☆16Aug 6, 2019Updated 6 years ago
ephialtes-t / shenbao-metadata
View on GitHub
☆12Aug 24, 2022Updated 3 years ago
cisnlp / parcoure
View on GitHub
ParCourE - Parallel Corpus Explorer
☆12Dec 27, 2021Updated 4 years ago
NielsRogge / tapas_utils
View on GitHub
A package containing utils for the PyTorch version of the Tapas algorithm.
☆11Apr 29, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
libeineu / SDT-Training
View on GitHub
The implementation of "Shallow-to-Deep Training for Neural Machine Translation"
☆10Oct 26, 2020Updated 5 years ago
ANXS / build-essential
View on GitHub
Ansible role for packages required for compiling C software from source.
☆15May 21, 2026Updated 2 months ago
ziyin-dl / ngram-word2vec
View on GitHub
☆18May 11, 2021Updated 5 years ago
Anterotesis / historical-texts
View on GitHub
Collections of english historical texts and data relating to them
☆19Mar 24, 2021Updated 5 years ago
lsqdecodebox / Spelling_Error_Correction
View on GitHub
复现 Soft-Masked BERT，论文 Spelling Error Correction with Soft-Masked BERT
☆12Oct 14, 2020Updated 5 years ago
howl-anderson / corpus_dataset_for_Chinese_NLP
View on GitHub
中文 NLP 语料库数据集
☆20Dec 14, 2018Updated 7 years ago
tiandiweizun / chinese-segmentation-evaluation
View on GitHub
中文分词工具评估
☆63Jan 31, 2023Updated 3 years ago
songmzhang / CBMI
View on GitHub
The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..
☆14Aug 6, 2022Updated 3 years ago
piantado / ngrampy
View on GitHub
Tools in python for dealing with Google Books Ngram files and other similar data sets.
☆19May 7, 2014Updated 12 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ekochmar / cl-datasci-pnp
View on GitHub
Data Science: Principles and Practice, 2019-20
☆11Nov 25, 2019Updated 6 years ago
sean-bennett112 / mesos-docker
View on GitHub
☆10Mar 28, 2015Updated 11 years ago
mmberg / nadia
View on GitHub
Natural Dialogue System
☆22Mar 23, 2016Updated 10 years ago
shuo-git / InfECE
View on GitHub
☆20Dec 31, 2020Updated 5 years ago
SVAIGBA / WMSeg
View on GitHub
☆174Dec 6, 2022Updated 3 years ago
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
YaoXinZhi / Bi-LSTM-Attention
View on GitHub
研一深度学习课程作业 Bi-LSTM + Attention （nice code）
☆11Jan 13, 2020Updated 6 years ago
vlisivka / docker-centos7-systemd-unpriv
View on GitHub
Dockerfile for CentOS7 with Systemd in unprivileged mode
☆12Mar 25, 2016Updated 10 years ago
cebel / pyctd
View on GitHub
PyCTD is a Python software package to query and analyse data from the CTD database
☆12Oct 6, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gexijin / learnR
View on GitHub
Learn R through examples
☆28Jan 17, 2024Updated 2 years ago
shooshx / zippypy
View on GitHub
A simple, lightweight Python 2.7 interpreter, with predictable memory management and without global locks.
☆20May 6, 2023Updated 3 years ago
DFKI-NLP / REval
View on GitHub
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
☆13Apr 21, 2020Updated 6 years ago
ashengtx / CilinSimilarity
View on GitHub
Word similarity computation based on Tongyici Cilin
☆122Jun 27, 2017Updated 9 years ago
ANXS / apt
View on GitHub
Ansible role for apt
☆14May 21, 2026Updated 2 months ago
cheenwe / qingting_api
View on GitHub
蜻蜓FM API
☆13Jan 8, 2017Updated 9 years ago
Nellaker-group / placenta
View on GitHub
☆21Nov 14, 2022Updated 3 years ago
jannson / wordmaker
View on GitHub
auto generate chinese words in huge text.
☆92Nov 25, 2014Updated 11 years ago
trangptm / Column_networks
View on GitHub
Column Networks for Collective Classification: A novel deep learning model for collective classification in multi-relational domains
☆12Nov 22, 2016Updated 9 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
BorgwardtLab / MvKDR
View on GitHub
Multi-view Spectral Clustering on Conflicting Views
☆15Jul 12, 2017Updated 9 years ago
SympCheck / NeuralSymptomChecker
View on GitHub
☆19Jul 29, 2022Updated 3 years ago
Satyam-Bhalla / Cheatsheets
View on GitHub
This repo contains all the cheatsheets that I found Important.
☆10Oct 27, 2020Updated 5 years ago
oliverwy / pytorch04ex
View on GitHub
☆11Feb 13, 2020Updated 6 years ago
THUNLP-MT / Template-NMT
View on GitHub
☆23Nov 15, 2022Updated 3 years ago
math-eval / aaai2024comp
View on GitHub
AAAI2024 Global Competition on Math Problem Solving and Reasoning
☆14Oct 4, 2023Updated 2 years ago
xdite / kaike
View on GitHub
☆10Sep 30, 2019Updated 6 years ago