DataHackIL / DataSets
A curated list of cool open datasets and APIs to use in machine learning driven projects.
☆27Updated 6 years ago
Alternatives and similar repositories for DataSets:
Users that are interested in DataSets are comparing it to the libraries listed below
- DataHack Challenges - Challenges offered during our hackathon by top data companies.☆11Updated 5 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- ☆13Updated 4 years ago
- Social Media Analysis for Situation Awareness during Crises (SMASAC) Tutorial☆25Updated 6 years ago
- A text similarity computation using minhashing and Jaccard distance on reuters dataset☆16Updated 6 years ago
- Resources for the Data Mining for Bussiness and Governance course.☆54Updated 4 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago
- ☆33Updated last year
- Experimental library for sampling and validating scikit-learn parameters☆10Updated 6 years ago
- Quill's library of open source NLP algorithms and data sets.☆52Updated last year
- A traits predictor using Python☆14Updated 6 years ago
- ☆21Updated 5 years ago
- sciblox - Easier Data Science and Machine Learning☆50Updated 7 years ago
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- LNEx: Location Name Extractor☆24Updated 4 years ago
- Research paper classification using machine learning and NLP☆27Updated 6 years ago
- Text Preprocessing in Python☆19Updated 8 years ago
- ULMFiT Method for German Language☆15Updated 5 years ago
- ☆32Updated 6 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated 2 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- A selection of business datasets☆18Updated 5 years ago
- Code for the paper "Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift…☆16Updated 7 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Using Data Science To Uncover State-backed Disinformation Campaigns On Twitter☆23Updated 4 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- ☆40Updated 9 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- I am teaching a Learning ML workshop for some folks @ Belong.co. Creating this repo to organise the course material.☆23Updated 6 years ago