mammothb / editdistpyLinks
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) distance.
☆23Updated last week
Alternatives and similar repositories for editdistpy
Users that are interested in editdistpy are comparing it to the libraries listed below
Sorting:
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Source code for the Apple reproduction☆32Updated 4 years ago
- ☆28Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated last month
- Combining encoder-based language models☆11Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 11 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Updated 3 years ago
- ☆17Updated last year
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 4 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Updated 2 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 6 months ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆11Updated 5 years ago
- Neural network sequence labeling model☆11Updated 5 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆9Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Library for fast text representation and classification.☆28Updated last year
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Rust python bindings for symspell☆19Updated last year
- ☆43Updated 2 years ago
- Robust Cross-lingual Embeddings from Parallel Sentences☆22Updated 4 years ago