DistrictDataLabs / minke
Graph extraction and NLP analysis for Baleen Corpora
☆18Updated 8 years ago
Alternatives and similar repositories for minke:
Users that are interested in minke are comparing it to the libraries listed below
- Multidimensional data explorer and visualization tool.☆56Updated 7 years ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Some convenient natural language tools that build on NLTK.☆85Updated 10 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆87Updated 6 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- ☆24Updated 6 years ago
- A python module that will check for package updates.☆28Updated 3 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 years ago
- Generate ipywidgets from Parameterized objects in the notebook☆36Updated 5 years ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆14Updated 7 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- AsyncIO serving for data science models☆24Updated 2 years ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 5 months ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Building Python Data Application Tutorials☆23Updated 6 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated last year