kerighan / eldar
Boolean text search in Python
β44Updated last year
Related projects: β
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ61Updated 6 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.β114Updated 5 months ago
- RaKUn 2.0 - A fast keyword detection algorithmβ61Updated last month
- An open-source package for python to clean raw text dataβ68Updated last year
- π« SpaCy wrapper for ConceptNet π«β88Updated last year
- Clean, filter and sample URLs to optimize data collection β Python & command-line β Deduplication, spam, content and language filtersβ113Updated 2 weeks ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.β21Updated last year
- β65Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.β69Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsβ53Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β25Updated last year
- Fast and robust date extraction from web pages, with Python or on the command-lineβ118Updated 2 weeks ago
- β53Updated 8 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β57Updated 4 months ago
- Language detection using Spacy and Fasttextβ53Updated 9 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 7 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 6 months ago
- A Streamlit component for annotating text by text selecting.β39Updated 3 months ago
- Python package for deduplication/entity resolution using active learningβ77Updated 3 weeks ago
- Sentence transformers models for SpaCyβ104Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around tβ¦β31Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.β42Updated 5 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iβ¦β46Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β71Updated last year
- Information extraction from English and German texts based on predicate logicβ133Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.β21Updated 4 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.β47Updated 2 months ago
- spaCy match and replace, maintaining conjugationβ34Updated last year
- Python API for https://vespa.ai, the open big data serving engineβ89Updated this week
- A python package to simulate typographical errors.β30Updated 9 months ago