mafudge / datasets
☆18Updated 3 weeks ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- Extracting LinkedIn comments from any post and export it to Excel file☆23Updated 6 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last year
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Deploy sentiment analysis using Flask☆17Updated 5 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- ☆11Updated 6 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.☆17Updated 8 years ago
- ☆20Updated 8 years ago
- Workshop materials for scraping Twitter with Python☆13Updated 8 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 3 months ago
- Companion code for my video course on Practical Python Data Science Techniques, published by Packt Publishing☆33Updated 7 years ago
- Brand disambiguator for tweets to differentiate e.g. Orange vs orange (brand vs foodstuff), using NLTK and scikit-learn☆57Updated 11 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- Different APIs for text analytics and SEMANTIC ANALYSIS using machine learning☆15Updated 8 years ago
- ☆15Updated 6 years ago
- Python library for efficient multi-threaded data processing, with the support for out-of-memory datasets.☆27Updated 5 years ago
- Sample techniques for a variety of feature extraction methods☆31Updated 4 years ago
- ☆40Updated 7 years ago
- Tutorial code and data for the entity resolution workshops.☆45Updated 9 years ago
- Very basic introduction to pyspark☆15Updated 8 years ago
- ☆23Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- ☆11Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 7 months ago
- UMD Course on Computational Journalism☆25Updated 8 years ago
- Python library providing sentiment lexicons.☆26Updated 8 years ago