nreimers / truecaserView external linksLinks
Language independent truecaser in Python.
☆160Oct 17, 2021Updated 4 years ago
Alternatives and similar repositories for truecaser
Users that are interested in truecaser are comparing it to the libraries listed below
Sorting:
- A python true casing utility that restores case information for texts☆88Nov 15, 2022Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Jun 17, 2024Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆495Feb 6, 2026Updated last week
- ☆10Feb 2, 2021Updated 5 years ago
- A web application tagging and retrieval of arguments in text☆29May 1, 2023Updated 2 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Apr 8, 2016Updated 9 years ago
- Expletives vomiting library...☆13Apr 17, 2017Updated 8 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Tool for manual evaluation of parallel sentences.☆15Jan 26, 2026Updated 2 weeks ago
- The distributed statistical machine translation infrastructure consisting of load balancing, text pre/post-processing and translation ser…☆12Nov 29, 2018Updated 7 years ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 7 years ago
- This is the text partitioner project for Python.☆21Dec 11, 2018Updated 7 years ago
- Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.☆43Jul 14, 2022Updated 3 years ago
- spaCy pipeline object for negating concepts in text☆282Jun 16, 2025Updated 7 months ago
- Query-Document Relevance☆42Feb 6, 2015Updated 11 years ago
- Efficient Low-Memory Aligner☆146Jan 15, 2025Updated last year
- Spoken Language Translation System☆14Jun 25, 2019Updated 6 years ago
- Download and load spaCy models on-the-fly☆15Feb 9, 2023Updated 3 years ago
- Python library for advanced text mining☆69Apr 11, 2020Updated 5 years ago
- c++ mosestokenizer☆18Mar 13, 2024Updated last year
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Sep 28, 2018Updated 7 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Jun 21, 2022Updated 3 years ago
- ☆31Mar 8, 2017Updated 8 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23May 19, 2015Updated 10 years ago
- Named Entity Recognition based on dictionaries☆241Mar 3, 2019Updated 6 years ago
- Server/Client around Spacy to load spacy only once☆46Jan 17, 2018Updated 8 years ago
- A collection of selected of models built with AllenNLP.☆25Feb 20, 2020Updated 5 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆900Aug 20, 2024Updated last year
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆111Feb 24, 2021Updated 4 years ago
- Project OCELoT: an Open, Collaborative Evaluation Leaderboard of Translations☆23Nov 5, 2025Updated 3 months ago
- spaCy-to-naf converter☆21Jun 10, 2025Updated 8 months ago
- Unsupervised multilingual sentence segmentation.☆21Feb 26, 2021Updated 4 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Jan 8, 2021Updated 5 years ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- Transition-based NER system☆35Jun 22, 2018Updated 7 years ago