CrossRef / reddit-dump-experiment
Experimental extraction of DOI citation information from Reddit submission dump.
☆8Updated 9 years ago
Alternatives and similar repositories for reddit-dump-experiment:
Users that are interested in reddit-dump-experiment are comparing it to the libraries listed below
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 5 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Perform basic NLP of popular subreddits to understand trending topics☆11Updated 9 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- ☆21Updated 6 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Supervised learning for novelty detection in text☆78Updated 8 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- An interactive map of reddit: the "front page of the internet"☆38Updated 9 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 5 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- Entity Linking for the masses☆56Updated 9 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- All the Harry Potter clusters you could ever want☆33Updated 9 years ago
- ☆28Updated 3 years ago
- Accompanying code for using hoverpy with scikitlearn☆10Updated 8 years ago
- bin files☆13Updated 3 weeks ago
- Training a classifier to reddit's TIL to find new things on Wikipedia☆35Updated 9 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- Find the essence☆108Updated 9 years ago
- An implementation of word2vec applied to [stanford philosophy encyclopedia](http://plato.stanford.edu/)☆35Updated 8 years ago
- IPython Magic for exporting pandas objects to Excel☆13Updated 7 years ago