A collection of the best open data sets and open-source tools for data science
☆1,141Aug 29, 2016Updated 9 years ago
Alternatives and similar repositories for dstk
Users that are interested in dstk are comparing it to the libraries listed below
Sorting:
- R wrapper for the Data Science Toolkit☆26Nov 20, 2017Updated 8 years ago
- Modular Street Address Geocoder☆397Aug 15, 2012Updated 13 years ago
- A simple Python library/tool for pulling location information from unstructured text☆186Dec 28, 2010Updated 15 years ago
- Cube: A system for time series visualization.☆3,886Apr 5, 2019Updated 6 years ago
- Unofficial API of Runkeeper☆22Apr 1, 2015Updated 10 years ago
- A ready-to-deploy system for aggregating regional boundary data (from shapefiles) and republishing that data via a RESTful JSON API.☆82Apr 7, 2022Updated 3 years ago
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- Python client library for controlling Google Refine☆83Jun 20, 2017Updated 8 years ago
- R package 2013 google trend☆15Jan 5, 2015Updated 11 years ago
- Exception catcher that runs on Google App Engine☆74Jun 21, 2013Updated 12 years ago
- A set of convenience functions in R for exploring iPhone and iPad location data☆37Apr 25, 2011Updated 14 years ago
- A uniform interface for domain data (deprecated)☆653Aug 24, 2015Updated 10 years ago
- TileMill is a modern map design studio☆3,153Nov 5, 2023Updated 2 years ago
- Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more☆8,790Aug 16, 2017Updated 8 years ago
- Social sentiment flagger intended to judge given text as: positive, neutral or negative.☆131Jul 18, 2012Updated 13 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆294Jun 29, 2022Updated 3 years ago
- Helpful bits to make caching and cache invalidation as painless as possible in Django!☆38Sep 3, 2011Updated 14 years ago
- a python port of https://github.com/twitter/twitter-text-rb also available via `pip install twitter_text`☆82Nov 8, 2017Updated 8 years ago
- style geojson based on data classifications☆114Mar 7, 2019Updated 7 years ago
- PANDA: A Newsroom Data Appliance☆208Jul 6, 2022Updated 3 years ago
- ☆26Oct 18, 2016Updated 9 years ago
- An object RESTational model☆195Jul 9, 2024Updated last year
- Example application using neo4j.rb☆45Jan 29, 2013Updated 13 years ago
- RaphaelJS Plugin to serialize SVG Objects for exporting☆120Nov 17, 2011Updated 14 years ago
- An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.☆4,111Sep 20, 2021Updated 4 years ago
- GIS data for the U.S.-Mexico border fence (perhaps a wall in the future)☆29Jul 3, 2017Updated 8 years ago
- A template I use to quickly set up Node.js-backed web apps on Amazon EC2☆323Feb 28, 2016Updated 10 years ago
- ☆1,056Jul 12, 2017Updated 8 years ago
- Open-source Dropbox using Ruby and Git☆1,134Apr 14, 2017Updated 8 years ago
- R package: funModeling: data cleaning, importance variable analysis and model perfomance☆100Feb 17, 2026Updated last month
- Convert JSON to a UNIX-friendly line-based format.☆301Jan 10, 2021Updated 5 years ago
- simple python datastructure wrappings for redis☆105Jun 26, 2021Updated 4 years ago
- A web renderer for geographic heat maps, using OpenStreetMap compatible file formats☆103May 22, 2023Updated 2 years ago
- A quick and dirty command line tool for bulk uploading documents to DocumentCloud.☆22Mar 17, 2011Updated 15 years ago
- Construct polygons from tagged points☆43Feb 11, 2013Updated 13 years ago
- an rdflib plugin to parse html5 microdata☆53Nov 3, 2011Updated 14 years ago
- JavaScript code rewriter for taming async-callback-style code☆806Dec 1, 2021Updated 4 years ago
- Towards open digital publishing☆926Aug 19, 2015Updated 10 years ago
- Corresponding Code for my Talk on 7/30 @ PyOhio☆28Jul 30, 2011Updated 14 years ago