vu3jej / scrapy-corenlp
☆59Updated 3 years ago
Alternatives and similar repositories for scrapy-corenlp:
Users that are interested in scrapy-corenlp are comparing it to the libraries listed below
- Python interface to the Stanford Named Entity Recognizer☆291Updated 3 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated 2 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- ☆43Updated 9 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated last year
- An introduction to using spaCy for NLP and machine learning☆191Updated 2 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Code for NLTK3 Cookbook☆141Updated 8 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- HackDelft☆81Updated 7 years ago
- Simple practice for text classification using Python☆58Updated 10 years ago
- clone of https://code.google.com/p/splitta/ so it can be a git submodule☆34Updated 11 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 9 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆59Updated 6 years ago
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Scrapes sites. Gets news. Eventually events.☆84Updated 8 years ago
- Twitter visualizaton experiment using various python-based technologies.☆60Updated 8 years ago
- Thin wrapper for the Microsoft Cognitive Services☆59Updated 7 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Automatic News Corpus Builder☆40Updated 7 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 6 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- A python implementation of DEPTA☆83Updated 8 years ago
- 💥 Browser-based slides or PDFs of our talks and presentations☆94Updated 6 years ago
- For extracting measurements and related entities from text☆57Updated 4 years ago