pushshift / rinzler
A high performance indexing and search system for managing big data
☆17Updated 6 years ago
Alternatives and similar repositories for rinzler
Users that are interested in rinzler are comparing it to the libraries listed below
Sorting:
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Measure the similarity of text corpora for 74 languages☆13Updated last year
- Fast and customizable tokenization☆64Updated 5 years ago
- Count-Min Tree Sketch: Approximate counting for NLP☆10Updated 7 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- A streaming ETL for fish☆13Updated 6 years ago
- BottomK minwise hashing for streaming set similarity☆43Updated 6 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Read and use word2vec vectors in Go☆56Updated 6 years ago
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.☆48Updated last month
- Logical backup and restore of a tarantool instance.☆12Updated 3 years ago
- Fast identification of character sequences in text or documents (multi-lingual)☆18Updated 9 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Python bindings for the fast integer compression library FastPFor.☆58Updated last year
- Natural language generation language☆56Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- alvd = A Lightweight Vald. A lightweight distributed vector search engine works without K8s.☆49Updated 3 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- ☆10Updated 5 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Python library to work with ConceptNet offline☆10Updated 2 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- A simple library for loading word2vec binary model.☆12Updated 9 years ago