hartwork / surrogatesLinks
Encode and decode pairs of surrogate characters in Python 3
☆10Updated 3 years ago
Alternatives and similar repositories for surrogates
Users that are interested in surrogates are comparing it to the libraries listed below
Sorting:
- Toolkit for domain-specific information retrieval experimentation☆19Updated 3 months ago
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Updated 7 years ago
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- Python requirements compilation☆14Updated last month
- The Keep It Simple Software Bill of Material☆11Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Python APTED algorithm for the Tree Edit Distance☆98Updated 7 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Updated 2 years ago
- Gather module dependencies of source code☆12Updated 2 years ago
- Python module (C extension and plain python) implementing DAWG☆20Updated 3 years ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 2 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated 2 weeks ago
- Library for fast text representation and classification.☆31Updated last year
- Service to scan licenses from source code☆12Updated 2 years ago
- Build and upload fastText Python wheels to PyPI☆26Updated last year
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆67Updated 2 years ago
- The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Te…☆32Updated 4 years ago
- super fast cpp implementation of longest common subsequence/substring☆72Updated last year
- A Python interface to PISA☆36Updated 3 weeks ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- The Continuous Clearing Tool scans and collects the 3rd party OSS components used in a NPM/NuGet/Debian/Maven/Python/Conan/Aipine project…☆29Updated this week
- Scripts as a service. Builds on systemd (for Linux)☆21Updated 2 years ago
- A Python 3 module that provides functions for splitting identifiers found in source code files.☆48Updated 2 years ago
- Targetted language identifier, based on FastText and Hunspell.☆37Updated last month
- Techniques used to run BLOOM at inference in parallel☆37Updated 2 years ago
- A utility to split tarballs into smaller pieces while keeping files intact.☆18Updated 3 years ago
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆29Updated 6 months ago
- SQuARE: Software for question answering research.☆75Updated last year
- A Python library for writing JSON documents as streams☆18Updated last year
- This is a mapping of CPEs to package urls created by using VulnerableCode's data☆10Updated 5 years ago