CrossRef / reddit-dump-experiment
Experimental extraction of DOI citation information from Reddit submission dump.
☆8Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for reddit-dump-experiment
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 5 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 6 years ago
- It finds best synonyms from Google Books when you press a hotkey☆30Updated 9 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- A comprehensive graph of mathematical domains and topics☆20Updated 2 years ago
- Training a classifier to reddit's TIL to find new things on Wikipedia☆35Updated 9 years ago
- Source code for my paper "Matrix Differential Calculus with Tensors (for Machine Learning)"☆12Updated 8 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 5 years ago
- Code for morphological transformations☆29Updated 7 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 3 years ago
- Publicly available data for Paperscape☆44Updated 6 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- Fast Dot Products on Pretty Big Data☆15Updated 6 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- Ranking Entity Types using the Web of Data☆30Updated 7 years ago
- Supervised learning for novelty detection in text☆79Updated 8 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Infinite relational model (IRM) for datamicroscopes☆14Updated 9 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆31Updated last year
- An implementation of word2vec applied to [stanford philosophy encyclopedia](http://plato.stanford.edu/)☆35Updated 8 years ago
- assorted text data☆34Updated 5 years ago
- A crowd-sourced dataset of 4800 Halloween costumes☆36Updated 7 years ago
- Easily identify and label sentence intervals using various taggers.☆16Updated 7 years ago
- two strange things to do with neural nets☆16Updated 5 years ago