pushshift / rinzlerLinks
A high performance indexing and search system for managing big data
☆17Updated 6 years ago
Alternatives and similar repositories for rinzler
Users that are interested in rinzler are comparing it to the libraries listed below
Sorting:
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆9Updated 7 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- Script to extract highly probable bots for further analysis☆13Updated 7 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- BottomK minwise hashing for streaming set similarity☆43Updated 6 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- 🔮 spaCy's Machine Learning library for NLP in Python☆8Updated 6 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Updated 10 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆33Updated last month
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.☆48Updated last month
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Burglary prediction for mortals☆10Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Locality Sensitive Hashing using Golang and SQL database☆28Updated 8 years ago
- Minhash LSH in Golang☆25Updated 5 years ago
- Algorithms for URL Classification☆19Updated 10 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Bleve Extensions☆48Updated last year
- Yet Another (natural language) Parser☆43Updated 6 years ago
- ☆32Updated 4 years ago