kerighan / eldarLinks
Boolean text search in Python
☆46Updated 6 months ago
Alternatives and similar repositories for eldar
Users that are interested in eldar are comparing it to the libraries listed below
Sorting:
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- An open-source package for python to clean raw text data☆74Updated 2 years ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆156Updated 3 weeks ago
- ☆68Updated 3 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- Faster, modernized fork of the language identification tool langid.py☆61Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 5 months ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- Concept Modeling: Topic Modeling on Images and Text☆217Updated last year
- Use ML-Annotate to label data for machine learning purposes☆110Updated 5 years ago
- 80x faster and 95% accurate language identification with Fasttext☆163Updated last year
- Blazing fast fuzzy text search for Python.☆51Updated 8 months ago
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆65Updated 11 months ago
- Powerful topic model visualization in Python☆139Updated 9 months ago
- ☆55Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆105Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last week
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Confection: the sweetest config system for Python☆192Updated 3 weeks ago
- Measure the readability of a given text using surface characteristics☆81Updated 11 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆156Updated 5 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆26Updated last month
- Extract text from HTML☆135Updated 5 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- ☆30Updated 3 years ago