dsynkov / newspaper-bulk
CLI to extract article contents in bulk using Newspaper3k and multithreading.
☆13Updated 6 years ago
Related projects: ⓘ
- A simple Flask & React app to demonstrate how to generate text with OpenAI's GPT-2☆52Updated last year
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 4 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 4 years ago
- Finds linguistic patterns effortlessly☆31Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆25Updated last year
- Visualize large text collections with WebGL☆25Updated 2 weeks ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆36Updated 5 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆54Updated 2 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 6 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated last year
- GraphiPy: Universal Social Data Extractor☆79Updated last year
- A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.☆114Updated 4 years ago
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆31Updated last year
- A suite of tools for collecting, pre-processing, analyzing and sentiment-scoring twitter data☆23Updated 3 years ago
- A spaCy wrapper for DBpedia Spotlight☆103Updated last year
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated 11 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- ☆32Updated 10 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- Python based Wikidata framework for easy dataframe extraction☆39Updated 9 months ago