ali-ce / datasetsLinks
My datasets - Original data or Aggregated / cleaned / restructured existing datasets. Released here under Creative Commons B
☆202Updated 8 years ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- A collection of tools to collect and download various data.☆210Updated 8 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 10 years ago
- Data journalism and easy to replicate notebooks using Python, R, and Web visualisations☆90Updated 7 years ago
- Download data from IMDB movies and parse into useful form☆206Updated 6 years ago
- Predict age and gender from a first name☆59Updated 7 years ago
- Analysing Weed Pricing across US - Data Analysis Workshop☆130Updated 3 months ago
- A wrapper around tweepy to produce pandas dataframes for analysis☆75Updated 9 years ago
- 538 Election Forecasting Model☆305Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- Python 3.x notebooks about real-world data cleaning and visualization☆72Updated 9 years ago
- Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts☆38Updated 9 years ago
- ☆46Updated 4 months ago
- A simple dataset of Stack Overflow questions and tags☆109Updated 8 years ago
- Curated list of all dataset websites that I find☆83Updated 7 years ago
- Journal of Statistical Education Paper on Using OkCupid Data for Data Science Courses☆236Updated 4 years ago
- Code examples and data for the KiwiPyCon 2014 NLP tutorial☆39Updated 11 years ago
- A dataset of the battles in the War of the Five Kings from George R.R. Martin's A Song Of Ice And Fire series.☆135Updated 4 years ago
- For the pandas tutorial at PyData Seattle: https://www.youtube.com/watch?v=otCriSKVV_8☆116Updated 4 years ago
- Python API for Glassdoor.com☆81Updated 9 years ago
- Scraping Tweet data for Russian Troll Twitter accounts into Neo4j☆57Updated 7 years ago
- Material for some talks I have given☆62Updated last year
- Generating the next read for our book club- with Data Science!☆39Updated 9 years ago
- A bot tweeting nonsensical craft beer reviews via Markov chains☆49Updated 9 years ago
- A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.☆293Updated 2 years ago
- Analysis of the Twitter Social graph using Python, NetworkX, and D3.js☆60Updated 13 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 10 years ago
- Rudimentary Bayesian Beta-Bernoulli A/B testing inference and visualization code.☆64Updated 11 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- A collection of public data sets☆517Updated 10 months ago
- materials for General Assembly Data Science DC course☆81Updated 10 years ago