kerighan / eldarLinks
Boolean text search in Python
☆46Updated 7 months ago
Alternatives and similar repositories for eldar
Users that are interested in eldar are comparing it to the libraries listed below
Sorting:
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 3 months ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆144Updated 2 months ago
- Concept Modeling: Topic Modeling on Images and Text☆217Updated last year
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- An open-source package for python to clean raw text data☆74Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆28Updated 2 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆158Updated last month
- an experimental implementation of Burrow's delta in Python 3☆21Updated 4 years ago
- A python package to simulate typographical errors.☆38Updated 2 years ago
- HDBSCAN Tuning for BERTopic Models☆50Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆70Updated 5 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last month
- ☆68Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆141Updated 2 years ago
- 🔢 Work with static vector models☆36Updated 9 months ago
- Blazing fast fuzzy text search for Python.☆51Updated 9 months ago
- Simply, faster, sentence-transformers☆143Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Updated 3 years ago
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated last year
- ☆55Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆105Updated last year
- Faster, modernized fork of the language identification tool langid.py☆60Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆105Updated last year
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- Measure the readability of a given text using surface characteristics☆81Updated last year