mafudge / datasetsLinks
☆18Updated 2 months ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆57Updated 11 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆21Updated 7 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- ☆32Updated 6 years ago
- ☆16Updated 4 years ago
- Code examples and data for the KiwiPyCon 2014 NLP tutorial☆39Updated 10 years ago
- ☆9Updated 6 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆17Updated 8 years ago
- Binding the GDELT universe in a Spark environment☆25Updated 2 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 4 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Code for the paper "Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift…☆16Updated 8 years ago
- Code and setup information for Introduction to Machine Learning with Spark☆12Updated 9 years ago
- Twitter user classification tutorial at PyCon France 2016☆21Updated last year
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago
- Natural Language Generation for Gramex applications.☆25Updated 2 years ago
- Augment IBM Watson Natural Language Understanding APIs with a configurable mechanism for text classification, uses Watson Studio.☆46Updated 6 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- An experiment of using the LDA machine learning algorithm to generate topics from documents and tag them with those topics☆17Updated 8 years ago
- Know your ML Score based on Sculley's paper☆34Updated 6 years ago
- ☆23Updated 5 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last week
- Python library for advanced text mining☆69Updated 5 years ago
- Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms☆36Updated 8 years ago