Distributed Streaming Quantiles (for PySpark)
☆38Jan 30, 2014Updated 12 years ago
Alternatives and similar repositories for dsq
Users that are interested in dsq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 11 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Sep 18, 2015Updated 10 years ago
- An example project for doing grid search in MLlib☆13Nov 27, 2014Updated 11 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fureteur is a simple, configurable, fault-tolerant web crawler written is Scala☆29Oct 14, 2014Updated 11 years ago
- Assignments of CS190.1x, Scalable Machine Learning☆18Aug 2, 2015Updated 10 years ago
- Listing my favorite research papers 📝 from different fields as I read them.☆10Oct 17, 2019Updated 6 years ago
- Toy scala/akka Bittorrent client. Written while learning scala and now unmaintained☆77Mar 13, 2015Updated 11 years ago
- They only live to get radical.☆13Nov 29, 2018Updated 7 years ago
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Dec 26, 2022Updated 3 years ago
- Scala/Akka wrapper for Oanda REST and Stream API☆14Feb 28, 2017Updated 9 years ago
- Documentation tools for common lisp☆15Sep 19, 2021Updated 4 years ago
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆12Feb 28, 2020Updated 6 years ago
- FRED simulator and associated paper☆26Jan 15, 2016Updated 10 years ago
- A program that generates a cartoon and a caption using SVG paths and Markov chains☆10Nov 28, 2016Updated 9 years ago
- List of awesome university courses for learning Computer Science!☆15Oct 1, 2018Updated 7 years ago
- Haskell implementation of HyperLogLog++ & MinHash for efficient cardinality and intersection estimation☆13Aug 1, 2016Updated 9 years ago
- Uses parselets and rwget to generate csv files from websites☆47Oct 16, 2009Updated 16 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Subscriber Registry API and SIP Authentication Server☆18Jul 13, 2016Updated 9 years ago
- Simple K Nearest Neighbour Algorithm☆37May 22, 2020Updated 6 years ago
- ADMM on Apache Spark☆31Jul 21, 2015Updated 10 years ago
- Translation of the QuickCheck properties in the paper "How to specify it!" by John Hughes into clojure test.check☆10Jul 19, 2019Updated 6 years ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 8 years ago
- 🥩Using Proof of Stake to secure Proofs of Steak☆14Jan 12, 2023Updated 3 years ago
- CS 294: Deep Reinforcement Learning, Spring 2017 Berkeley☆11Feb 19, 2017Updated 9 years ago
- 🌦️ Domain Ranker☆16Sep 7, 2019Updated 6 years ago
- ☆12Feb 23, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple riemann query tool written in Go.☆21Dec 2, 2016Updated 9 years ago
- This repo is archived. Active work continued in fork github.com/pliba/kaminpy☆24Nov 19, 2018Updated 7 years ago
- Pig on Apache Spark☆82Mar 23, 2015Updated 11 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Jan 27, 2024Updated 2 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- ElasticSearch Prediction Generator and Plugin☆22Sep 17, 2015Updated 10 years ago
- scikit-learn repo for ogrisel's work in progress contributions☆19Jun 3, 2026Updated last week