mike0sv / Reuters-full-data-setLinks
Full dataset of Reuters composed of 8,551,441 news titles, links and timestamps (Jan 2007 - Aug 2016).
β22Updated 9 years ago
Alternatives and similar repositories for Reuters-full-data-set
Users that are interested in Reuters-full-data-set are comparing it to the libraries listed below
Sorting:
- An end-to-end event extraction and summarization system.β22Updated 5 years ago
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Updated 3 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniquesβ29Updated 5 years ago
- Extracting narrative timelines (i.e. order and timing of events) from textβ20Updated 6 years ago
- WordNet Domains, WordNet Affect and SentiWordsβ48Updated 10 years ago
- Neural Elastic Inference and Searchβ19Updated 6 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around tβ¦β34Updated 2 years ago
- Python library for advanced text miningβ69Updated 5 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summarizβ¦β41Updated 6 years ago
- SENTiVENT: Company-specific event detection in economic newsβ24Updated 7 years ago
- tools to analyze a collection of texts and identify relevant wordsβ12Updated 7 years ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppoβ¦β47Updated 2 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trendsβ58Updated last year
- A raspberry pi 64bit image with spacy and neuralcoref pre-installedβ21Updated 6 years ago
- Similarity search on Wikipedia using gensim in Python.β60Updated 7 years ago
- A news crawler for BBC News, Reuters and New York Times.β129Updated 3 years ago
- Tensorflow Recurrent Neural Network (RNN) model to analyse Time Series in GDELT News dataset to predict future events.β30Updated 8 years ago
- Discover relevant information about categorical data with entity embeddings using Neural Networks (powered by Keras)β70Updated 3 years ago
- A framework for the analysis of social interaction networks (e.g. induced by Twitter mentions) in time.β61Updated 9 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs botβ¦β11Updated 4 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.β33Updated 7 years ago
- Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.β24Updated 3 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extractionβ24Updated 3 years ago
- Tool for sentiment analysis annotationβ13Updated 9 months ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classificationβ29Updated 11 months ago
- OKR: A Consolidated Open Knowledge Representation for Multiple Textsβ41Updated 7 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questionsβ79Updated 2 years ago
- Paraphrase Generation model using pair-wise discriminator lossβ46Updated 4 years ago
- classify a job description (or noisy job title) into a ONET job titleβ19Updated 9 years ago
- Deep Knowledge Extraction from Textβ38Updated 3 years ago