mike0sv / Reuters-full-data-setLinks
Full dataset of Reuters composed of 8,551,441 news titles, links and timestamps (Jan 2007 - Aug 2016).
☆22Updated 9 years ago
Alternatives and similar repositories for Reuters-full-data-set
Users that are interested in Reuters-full-data-set are comparing it to the libraries listed below
Sorting:
- An end-to-end event extraction and summarization system.☆22Updated 5 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆29Updated 5 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- 📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset☆21Updated 2 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 6 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 6 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated 10 months ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- A news crawler for BBC News, Reuters and New York Times.☆127Updated 2 years ago
- Python implementation of MABED (Mention-Anomaly-Based Event Detection)☆38Updated 6 years ago
- Applying Snorkel to SuperGLUE☆26Updated 5 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)☆56Updated last year
- Quill's library of open source NLP algorithms and data sets.☆52Updated last year
- Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora☆31Updated 4 months ago
- Neural Elastic Inference and Search☆19Updated 6 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Model for learning document embeddings along with their uncertainties☆36Updated 2 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated 11 months ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago
- SENTiVENT: Company-specific event detection in economic news☆24Updated 7 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 9 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated 2 years ago
- Tool for sentiment analysis annotation☆13Updated 8 months ago
- Deep Knowledge Extraction from Text☆38Updated 3 years ago
- GraphOfDocs: Representing multiple documents as a single graph☆20Updated 3 years ago
- Deep neural parser for database query☆18Updated 3 years ago