parkervg / news-article-clusteringLinks
A document similarity project attempting to cluster news stories covering identical events.
☆26Updated 4 years ago
Alternatives and similar repositories for news-article-clustering
Users that are interested in news-article-clustering are comparing it to the libraries listed below
Sorting:
- A Python Package which helps to scrape all news details from any news websites☆211Updated last month
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆524Updated 8 months ago
- This repository provides usage examples for the Python module Newspaper3k.☆147Updated last year
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆153Updated last year
- GetOldTweets-Python is a project written in Python to mine old and backdated tweets, It bypasses some limitations/restrictions of the Twi…☆126Updated 2 years ago
- LexPredict Legal Dictionaries☆119Updated 2 years ago
- A python utility for downloading Common Crawl data☆242Updated 2 years ago
- 📊 Semantic search for headlines and story text☆360Updated last year
- Text analysis with networks.☆286Updated 3 months ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated last year
- Open Source Thesaurus of Job Titles in US English☆138Updated 2 years ago
- A list of selected resources, methods, and tools dedicated to Legal Text Analytics.☆663Updated 8 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 3 years ago
- BookNLP, a natural language processing pipeline for books☆850Updated 11 months ago
- LexNLP by LexPredict☆736Updated last year
- `scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into struct…☆495Updated 2 years ago
- Find legal citations in any block of text☆160Updated 2 weeks ago
- analyze text with empath☆333Updated 8 years ago
- Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media☆43Updated last year
- Source code for the Medium article "Extracting the author of news stories with DOM-based segmentation and BERT"☆29Updated 5 years ago
- CUAD (NeurIPS 2021)☆440Updated 2 years ago
- A spaCy pipeline and model for NLP on unstructured legal text.☆655Updated last year
- Ultimate Website Sitemap Parser☆222Updated 3 weeks ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Updated 3 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆221Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆341Updated 7 months ago
- Scrape data from Quora website: questions related to certain topics, answers given on certain questions and users profile data☆54Updated 2 years ago
- Python Pushshift.io API Wrapper (for comment/submission search)☆361Updated 2 years ago
- A Python program to scrape Google's Knowledge Panels for details on a list of businesses☆19Updated 2 years ago
- A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational e…☆900Updated last year