shenzhun / creating-enron-spam-corpus-from-raw-dataLinks
Using raw data of Enron spam datasets to create a corpus using python, nltk and shell script.
☆8Updated 11 years ago
Alternatives and similar repositories for creating-enron-spam-corpus-from-raw-data
Users that are interested in creating-enron-spam-corpus-from-raw-data are comparing it to the libraries listed below
Sorting:
- An Information Extraction Framework with Deep Learning developed at New York University☆15Updated 8 years ago
- In this project, there are two major tasks: text data processing and text categorization. In text data processing, we have done tokenizat…☆8Updated 8 years ago
- Recursive Neural Tensor Network for Semantic Role Labeling☆8Updated 9 years ago
- Large-scale topic discovery with Sampled-MinHashing☆10Updated 5 years ago
- Learning embeddings for transitive verb phrases☆12Updated 9 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Updated 11 years ago
- A TensorFlow implementation of dependency-based word embeddings (dependency-based word2vec)☆11Updated 9 years ago
- Tensorflow code to train and predict geolocation of tweets; network also produces compact representation of tweets for hashing purposes☆16Updated 3 months ago
- Embeddings for n-grams☆11Updated 6 years ago
- A Latent Dirichlet Allocation topic modeling package based on SparseLDA Gibbs Sampling inference algorithm☆8Updated 12 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- Quick-Data-Science-Experiments☆19Updated 7 years ago
- An active annotation tool based on brat(https://github.com/nlplab/brat)☆19Updated 7 years ago
- Introduction Notebook to Extreme Multi-Label Classification problem (XML)☆22Updated 6 years ago
- Resources for deep learning: papers, articles, courses☆10Updated 5 years ago
- An online learning perceptron benchmark for Kaggle movie review competition☆25Updated 9 years ago
- A tutorial on how to build your own Neural Language Model☆10Updated 2 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- A set of tools and experimental scripts used to achieve multimodal learning with nonnegative matrix factorization (NMF).☆18Updated 8 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23Updated 10 years ago
- Experiment code for AAAI paper: A Neural Probabilistic Model for Context Based Citation Recommendation☆9Updated 7 years ago
- Prediction model for Kaggle/Rossmann competition.☆13Updated 9 years ago
- Language Modelling, CMI vs Perplexity☆11Updated 7 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆55Updated 9 years ago
- ☆18Updated 7 years ago
- Jupyter Notebook presentation for class imbalance in binary classification☆48Updated 6 years ago
- *SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach☆21Updated 6 years ago
- Deep Generative Stochastic Networks for Sequence Prediction☆8Updated 2 years ago
- ☆12Updated 8 years ago
- Gibbs sampler for for a Naive Bayes document classifier☆24Updated 12 years ago