Normalize text string
☆12Nov 6, 2018Updated 7 years ago
Alternatives and similar repositories for text-normalizer
Users that are interested in text-normalizer are comparing it to the libraries listed below
Sorting:
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)☆11Sep 28, 2016Updated 9 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- ☆15Jul 15, 2019Updated 6 years ago
- A Python Wrapper of Stanford Chinese Segmenter☆20Aug 2, 2017Updated 8 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Dec 6, 2016Updated 9 years ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆44Oct 10, 2025Updated 4 months ago
- Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of…☆32May 22, 2024Updated last year
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆36May 14, 2017Updated 8 years ago
- RNNs for Text Normalization☆40Dec 12, 2017Updated 8 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- This repo contains all the cheatsheets that I found Important.☆10Oct 27, 2020Updated 5 years ago
- Source Code for "Improved Embeddings for Learning Prerequisite Chains" (CPSC 490 - Senior Project)☆11May 2, 2019Updated 6 years ago
- This is my 2024 course for TAP Institute on Vector Databases and Semantic Searching.☆12Jul 26, 2024Updated last year
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 2 weeks ago
- rabitq rust implementation☆10Feb 4, 2026Updated 3 weeks ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 7 years ago
- ☆37Nov 22, 2025Updated 3 months ago
- python CRF++实现分词☆37Jun 19, 2018Updated 7 years ago
- Classify audio samples using a neural network☆10May 19, 2017Updated 8 years ago
- Extending Python's process pool to support asyncio functions☆12Sep 23, 2021Updated 4 years ago
- VAS-CRIU: Process checkpoint/restore with fast in-memory snapshotting of memory using MVAS.☆14Mar 6, 2018Updated 7 years ago
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Jan 26, 2017Updated 9 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- A Python JIT compiler☆12May 29, 2019Updated 6 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Portable wget for windows.☆11Aug 23, 2015Updated 10 years ago
- Simple setup for personal dotfiles☆11Nov 29, 2025Updated 3 months ago
- Deep Autoencoding Predictive Components☆10Mar 4, 2021Updated 4 years ago
- Interlinear glosses for pandoc☆10Feb 12, 2018Updated 8 years ago
- Music Line Bot powered by OLAMI and KKBOX Open API.☆11Dec 8, 2022Updated 3 years ago
- ☆11Apr 24, 2023Updated 2 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Jan 26, 2021Updated 5 years ago
- ☆10Jun 5, 2025Updated 8 months ago