kotartemiy / topic-labeled-news-dataset
100k+ topic labeled news articles published from thousands of news websites
☆19Updated 4 years ago
Alternatives and similar repositories for topic-labeled-news-dataset:
Users that are interested in topic-labeled-news-dataset are comparing it to the libraries listed below
- Interpretable feature construction from taxonomies for text classification☆18Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- ☆30Updated 2 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆15Updated last year
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- LLM plugin for clustering embeddings☆75Updated last year
- Transforming textual descriptions into process models using deep learning☆14Updated 5 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆17Updated 8 years ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 5 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆12Updated 4 years ago
- Exploration and charting of world income distribution☆12Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 4 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- A web application tagging and retrieval of arguments in text☆28Updated last year
- ☆11Updated 5 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- Attempt to reconstruct daily google trends data over longer periods of time.☆10Updated 6 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- ☆9Updated 6 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- A curated list of ML awesome frameworks & libraries for text data☆16Updated 2 years ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Updated 8 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 5 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated 3 months ago