kotartemiy / topic-labeled-news-dataset
100k+ topic labeled news articles published from thousands of news websites
☆19Updated 4 years ago
Alternatives and similar repositories for topic-labeled-news-dataset:
Users that are interested in topic-labeled-news-dataset are comparing it to the libraries listed below
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- A News Article Collection Library☆22Updated last year
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- ☆30Updated 2 years ago
- ☆19Updated 6 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆12Updated 4 years ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 4 years ago
- ☆11Updated 5 years ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆17Updated 8 years ago
- A Streamlit application to visualize sentence embeddings☆19Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 5 months ago
- Transforming textual descriptions into process models using deep learning☆13Updated 5 years ago
- ☆13Updated 5 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 5 years ago
- A conda-smithy repository for spacy.☆14Updated 3 months ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- ☆18Updated 4 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Exploits Wikipedia's daily view counts to find out what topics are current trends☆17Updated 11 years ago