kerighan / eldar
Boolean text search in Python
☆45Updated 2 years ago
Alternatives and similar repositories for eldar:
Users that are interested in eldar are comparing it to the libraries listed below
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆135Updated 2 months ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- An open-source package for python to clean raw text data☆69Updated last year
- Blazing fast fuzzy text search for Python.☆42Updated last month
- 💫 SpaCy wrapper for ConceptNet 💫☆90Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 10 months ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated last month
- A News Article Collection Library☆22Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated last week
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 2 months ago
- ☆30Updated 2 years ago
- ☆68Updated 2 years ago
- Python package for deduplication/entity resolution using active learning☆76Updated 6 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆48Updated 7 months ago
- RaKUn 2.0 - A fast keyword detection algorithm☆65Updated last week
- A BERT-based application for reusable text classification at scale☆38Updated last year
- ☆54Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- spaCy entry points for Curated Transformers☆27Updated 5 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Tools for interactive visual exploration of semantic embeddings.☆30Updated 5 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- Pre-train Static Word Embeddings☆47Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 10 months ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated 11 months ago