kotartemiy / topic-labeled-news-dataset
100k+ topic labeled news articles published from thousands of news websites
☆18Updated 4 years ago
Alternatives and similar repositories for topic-labeled-news-dataset:
Users that are interested in topic-labeled-news-dataset are comparing it to the libraries listed below
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- ☆30Updated 2 years ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆16Updated 8 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 9 months ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated last year
- Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity☆14Updated 8 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 8 months ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- LLM plugin for clustering embeddings☆65Updated 10 months ago
- A web application tagging and retrieval of arguments in text☆29Updated last year
- Word embeddings for job postings☆13Updated 2 years ago
- Exploration and charting of world income distribution☆12Updated 5 years ago
- Statistical visualizations for Datasette using Seaborn☆11Updated 2 years ago
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 6 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- A Foursquare data scraper that gathers all venues within a specified geographic area.☆39Updated 5 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Track changes to GraphQL APIs by git scraping their schemas☆27Updated this week
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Burglary prediction for mortals☆10Updated 7 months ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- Question Generation - Question Answering for Automatic Flashcards☆64Updated 2 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week