pushshift / rinzler
A high performance indexing and search system for managing big data
☆17Updated 5 years ago
Related projects: ⓘ
- A workflow system for Natural Language Processing.☆21Updated 4 years ago
- Text readability metrics in Python.☆12Updated 11 years ago
- Read compressed NDJSON .zst files easily☆33Updated 2 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.☆47Updated 3 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 5 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated this week
- Hidden alignment conditional random field for classifying string pairs.☆37Updated 7 years ago
- Burglary prediction for mortals☆10Updated 3 months ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Example how to pre-process news articles with textbox and index on Elastic Search☆13Updated 7 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Code for morphological transformations☆29Updated 7 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated last year
- Script to extract highly probable bots for further analysis☆12Updated 7 years ago
- WordNet Domains, WordNet Affect and SentiWords☆49Updated 8 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆25Updated 5 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- ☆42Updated 7 years ago
- "Learning What is Essential in Questions", CoNLL, 2017☆26Updated 6 years ago
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 6 years ago
- stav text annotation visualiser☆34Updated 12 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 9 years ago
- ☆38Updated this week
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago