kerighan / eldar
Boolean text search in Python
☆44Updated 2 years ago
Alternatives and similar repositories for eldar:
Users that are interested in eldar are comparing it to the libraries listed below
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆32Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- Measure the readability of a given text using surface characteristics☆74Updated 2 years ago
- ☆54Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Small python package to measure OCR quality and other related metrics.☆21Updated 11 months ago
- Python package for deduplication/entity resolution using active learning☆78Updated 5 months ago
- Quote identification, attribution and resolution.☆12Updated last year
- A News Article Collection Library☆22Updated last year
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Updated last year
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- ☆13Updated 3 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆64Updated this week
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 10 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆121Updated last month
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 8 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 11 months ago
- ☆67Updated 10 months ago
- Python API for https://vespa.ai, the open big data serving engine☆113Updated this week
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Extract dates from text☆64Updated 4 years ago
- ☆42Updated last year
- A python package to simulate typographical errors.☆31Updated last year