Distributed Streaming Quantiles (for PySpark)
☆38Jan 30, 2014Updated 12 years ago
Alternatives and similar repositories for dsq
Users that are interested in dsq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Sep 18, 2015Updated 10 years ago
- An example project for doing grid search in MLlib☆13Nov 27, 2014Updated 11 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Subject-oriented component library for JavaScript.☆21Jul 2, 2014Updated 11 years ago
- Toy scala/akka Bittorrent client. Written while learning scala and now unmaintained☆77Mar 13, 2015Updated 11 years ago
- They only live to get radical.☆13Nov 29, 2018Updated 7 years ago
- Scala/Akka wrapper for Oanda REST and Stream API☆14Feb 28, 2017Updated 9 years ago
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark☆11Jul 27, 2015Updated 10 years ago
- A program that generates a cartoon and a caption using SVG paths and Markov chains☆10Nov 28, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Haskell implementation of HyperLogLog++ & MinHash for efficient cardinality and intersection estimation☆12Aug 1, 2016Updated 9 years ago
- Uses parselets and rwget to generate csv files from websites☆47Oct 16, 2009Updated 16 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- ADMM on Apache Spark☆31Jul 21, 2015Updated 10 years ago
- Translation of the QuickCheck properties in the paper "How to specify it!" by John Hughes into clojure test.check☆10Jul 19, 2019Updated 6 years ago
- Bucketing and partitioning system for Parquet☆30May 22, 2018Updated 7 years ago
- 🥩Using Proof of Stake to secure Proofs of Steak☆14Jan 12, 2023Updated 3 years ago
- CS 294: Deep Reinforcement Learning, Spring 2017 Berkeley☆11Feb 19, 2017Updated 9 years ago
- 🌦️ Domain Ranker☆16Sep 7, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Feb 23, 2024Updated 2 years ago
- Simple riemann query tool written in Go.☆21Dec 2, 2016Updated 9 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Jan 27, 2024Updated 2 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- ☆11Dec 26, 2022Updated 3 years ago
- Code and architecture diagrams for performance testing a few API approaches on AWS☆10Apr 20, 2019Updated 6 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Spark Custome Stream Source and Sink☆12Jan 19, 2019Updated 7 years ago
- Jupyter Notebooks to be used with Advanced Analytics Workspace platform☆13Jul 14, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a skeleton of a Scala project with maven to start using Spark☆44May 2, 2015Updated 10 years ago
- ☆48Jul 25, 2024Updated last year
- Blog sources: kept mostl as IPython notebooks that can be immediately converted to blogger posts.☆26Nov 13, 2013Updated 12 years ago
- NLP text recommendation system built in Python using Gensim, spaCy, and Plotly Dash☆15Mar 8, 2018Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- Like jq, but with json pointers☆16Nov 30, 2025Updated 3 months ago
- Script to import youtube-dl metadata to PostgreSQL☆14Aug 13, 2018Updated 7 years ago