kotartemiy / topic-labeled-news-dataset
100k+ topic labeled news articles published from thousands of news websites
☆18Updated 4 years ago
Alternatives and similar repositories for topic-labeled-news-dataset:
Users that are interested in topic-labeled-news-dataset are comparing it to the libraries listed below
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆9Updated 3 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 10 months ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆16Updated 8 years ago
- ☆12Updated 5 years ago
- A web application tagging and retrieval of arguments in text☆29Updated last year
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 4 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- A financial disclosure data extraction tool.☆13Updated last year
- RxNLP APIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity …☆15Updated 5 years ago
- Relational NLP: Convert text into relational facts.☆9Updated 5 years ago
- Aviation grade news article metadata extraction☆36Updated last year
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆18Updated 9 years ago
- REST API for Text Summarization and Keywords Extraction☆16Updated 2 years ago
- A Flask webapp that categorizes Outlook emails using machine learning☆15Updated 9 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 5 months ago
- Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity☆14Updated 8 years ago
- A News Article Collection Library☆22Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 9 months ago
- Web crawler for Burplist, a search engine for craft beers in Singapore☆14Updated this week
- Transforming textual descriptions into process models using deep learning☆13Updated 5 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year