voidism/pywordseg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voidism/pywordseg)

voidism / pywordseg

Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816

☆46

Alternatives and similar repositories for pywordseg

Users that are interested in pywordseg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

grtzsohalf / buy_vs_rent_and_invest
View on GitHub
☆15Sep 9, 2021Updated 4 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
lantw44 / ceiba-dl
View on GitHub
NTU CEIBA 資料下載工具
☆79Feb 18, 2022Updated 4 years ago
voidful / NLPrep
View on GitHub
🍳 NLPrep - dataset tool for many natural language processing task
☆28Jul 30, 2021Updated 4 years ago
pohanchi / huggingface_albert
View on GitHub
hugginface albert model and its tokenizer
☆15Mar 12, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MeteorYee / LSTM-CNN-CWS
View on GitHub
A Deep Learning Tool for Chinese word segmentation (Bi-LSTM, CNN, CRF)
☆35Mar 27, 2019Updated 7 years ago
orbxball / timit-preprocessor
View on GitHub
Extract mfcc vectors and phones from TIMIT dataset
☆17Mar 23, 2023Updated 3 years ago
howard1337 / S2VC
View on GitHub
☆100Jul 22, 2021Updated 5 years ago
ga642381 / Taiwanese-Translation
View on GitHub
Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus
☆13Oct 15, 2022Updated 3 years ago
ffaisal93 / SD-QA
View on GitHub
☆16Feb 10, 2026Updated 5 months ago
ga642381 / Taiwanese-Speech-Synthesis
View on GitHub
Taiwanese Speech Synthesis with Tacotron2
☆26Oct 2, 2022Updated 3 years ago
marcanuy / cedict_utils
View on GitHub
Parser for CC-CEDICT Chinese-English dictionary in Python
☆12Jul 25, 2023Updated 3 years ago
George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
JSALT-2022-SSL / superb-prosody
View on GitHub
☆31Jul 13, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
bojone / NNCWS
View on GitHub
Neutral Network based Chinese Segment System
☆19Nov 29, 2016Updated 9 years ago
my-yy / s2v_rc
View on GitHub
Speech2Vec Reality Check
☆88Feb 21, 2023Updated 3 years ago
cyhuang-tw / AutoVC
View on GitHub
An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".
☆34Apr 26, 2021Updated 5 years ago
MaikeZuefle / f-actor
View on GitHub
☆28Jul 17, 2026Updated last week
jurekkow / bert-squad-demo
View on GitHub
Demo web server app that shows how BERT model trained on SQuAD dataset deals with the machine comprehension task.
☆10Dec 8, 2022Updated 3 years ago
vectominist / spin
View on GitHub
Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…
☆65May 19, 2023Updated 3 years ago
Splend1d / T5lephone
View on GitHub
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Nov 29, 2022Updated 3 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
voidful / awesome-question-answering-dataset
View on GitHub
A list of awesome machine question answering dataset - 機器問答數據集
☆15Dec 24, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zzeng13 / DISC
View on GitHub
Automatic Idiomatic Expression Detection
☆13Sep 26, 2021Updated 4 years ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 5 years ago
supafull / magnetic
View on GitHub
Demo/example with react-admin, electric-sql and local supabase with k3d/helm
☆11Jun 16, 2024Updated 2 years ago
sheng-z / cross-lingual-open-ie
View on GitHub
MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models
☆23Jul 15, 2018Updated 8 years ago
biug / pkunlp
View on GitHub
pku nlp toolkit
☆10Jun 5, 2018Updated 8 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
wangjin0818 / word_embedding_refine
View on GitHub
☆23Aug 13, 2018Updated 7 years ago
falcondai / chinese-char-lm
View on GitHub
explores Chinese language models with sub-character level visual information
☆16Oct 5, 2018Updated 7 years ago
robinvanschaik / interpret-flair
View on GitHub
A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.
☆27May 13, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
voidful / llm-codec
View on GitHub
LLM-Codec: Neural Audio Codec Meets Language Model Objectives
☆23May 3, 2026Updated 2 months ago
ga642381 / SpeechPrompt
View on GitHub
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Apr 10, 2025Updated last year
shenfei1010 / CyberCan
View on GitHub
CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…
☆12Aug 24, 2021Updated 4 years ago
asafamr / SymPatternWSI
View on GitHub
Word Sense Induction with neural Bi-language Models and symmetric patterns
☆12Aug 31, 2018Updated 7 years ago
tzuhsien / Voice-conversion-evaluation
View on GitHub
An evaluation toolkit for voice conversion models.
☆42Jul 11, 2021Updated 5 years ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
Makisuo / pglite-drizzle
View on GitHub
☆14Mar 9, 2025Updated last year