Distributed Streaming Quantiles (for PySpark)
☆38Jan 30, 2014Updated 12 years ago
Alternatives and similar repositories for dsq
Users that are interested in dsq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Sep 18, 2015Updated 10 years ago
- An example project for doing grid search in MLlib☆13Nov 27, 2014Updated 11 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fureteur is a simple, configurable, fault-tolerant web crawler written is Scala☆29Oct 14, 2014Updated 11 years ago
- A Neural network implementation with Scala☆20Jul 17, 2016Updated 9 years ago
- ECS Container Express☆32Dec 12, 2018Updated 7 years ago
- They only live to get radical.☆13Nov 29, 2018Updated 7 years ago
- Online Summarization Algorithm for Twitter Streams - supporting code for an EACL 2014 paper☆16Feb 25, 2014Updated 12 years ago
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Dec 26, 2022Updated 3 years ago
- A Python script to swoop and decrypt passwords from Chrome's local storage.☆11Dec 10, 2018Updated 7 years ago
- A script that queries OSM using AWS Athena for buildings in a given bounding box. Demo:☆14Nov 7, 2018Updated 7 years ago
- Python scripts for Agisoft Photoscan☆12Jun 18, 2015Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A docker image for Omeka S - does not include either modules or themes, just Omeka itself☆17Mar 27, 2026Updated last month
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark☆10Jul 27, 2015Updated 10 years ago
- A bash tool (script) to generate animated (gif) temporal progressions of land cover with inputs of lat, long, and start/end dates. Requir…☆17Mar 25, 2015Updated 11 years ago
- FRED simulator and associated paper☆26Jan 15, 2016Updated 10 years ago
- We store attacks and exploits that we've found useful in our research☆13Jun 4, 2015Updated 10 years ago
- a trained attention-based summarization model☆10May 22, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- List of awesome university courses for learning Computer Science!☆15Oct 1, 2018Updated 7 years ago
- Uses parselets and rwget to generate csv files from websites☆47Oct 16, 2009Updated 16 years ago
- Tensorflow implementation of Target-dependent LSTM (Tang et al. 2016)☆10Sep 13, 2017Updated 8 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- Scripts for the Python API in PhotoScan☆17Sep 1, 2015Updated 10 years ago
- ADMM on Apache Spark☆31Jul 21, 2015Updated 10 years ago
- Notes and tasks code for Cloudera / Udacity hadoop course☆16Jul 31, 2015Updated 10 years ago
- Translation of the QuickCheck properties in the paper "How to specify it!" by John Hughes into clojure test.check☆10Jul 19, 2019Updated 6 years ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🥩Using Proof of Stake to secure Proofs of Steak☆14Jan 12, 2023Updated 3 years ago
- Experiments with WebGL + proj4☆16Apr 16, 2013Updated 13 years ago
- CS 294: Deep Reinforcement Learning, Spring 2017 Berkeley☆11Feb 19, 2017Updated 9 years ago
- Simple riemann query tool written in Go.☆21Dec 2, 2016Updated 9 years ago
- Pig on Apache Spark☆82Mar 23, 2015Updated 11 years ago
- A higher-level module for creating content models using levelup as db.☆26Sep 15, 2018Updated 7 years ago
- Code and architecture diagrams for performance testing a few API approaches on AWS☆10Apr 20, 2019Updated 7 years ago