laserson/dsq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/laserson/dsq)

laserson / dsq

Distributed Streaming Quantiles (for PySpark)

☆38

Alternatives and similar repositories for dsq

Users that are interested in dsq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DataDog / brod
View on GitHub
## Auto-archived due to inactivity. ## An unmaintained python client to Kafka 0.6
☆32Apr 8, 2023Updated 3 years ago
kiranvodrahalli / cos521
View on GitHub
Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.
☆12Jan 13, 2015Updated 11 years ago
avibryant / simmer
View on GitHub
Reduce your data. A unix filter for algebird-powered aggregation.
☆141Apr 17, 2017Updated 9 years ago
daithiocrualaoich / spark-emr
View on GitHub
Spark Elastic MapReduce bootstrap and runnable examples.
☆17Jun 26, 2013Updated 13 years ago
stripe-archive / herringbone
View on GitHub
Tools for working with parquet, impala, and hive
☆135Jan 4, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ogrisel / spylearn
View on GitHub
Repo for experiments on pyspark and sklearn
☆79Feb 19, 2014Updated 12 years ago
witgo / spark
View on GitHub
Mirror of Apache Spark
☆11Apr 30, 2026Updated 2 months ago
jatrost / hadoop-binary-analysis
View on GitHub
Framework that makes processing arbitrary binary data in Hadoop easier
☆22Apr 8, 2013Updated 13 years ago
big-data-research / in-memory-data-pipeline
View on GitHub
The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.
☆10Jun 1, 2015Updated 11 years ago
OpenSOC / opensoc.github.io
View on GitHub
☆18Sep 30, 2014Updated 11 years ago
jwills / exhibit
View on GitHub
A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.
☆54Sep 18, 2015Updated 10 years ago
seiflotfy / s-bitmap
View on GitHub
S-Bitmap: Distinct Counting with a Self-Learning Bitmap
☆37Nov 1, 2015Updated 10 years ago
traintracks / sparkstreaming-algebird-algorithms-demo
View on GitHub
☆18Sep 7, 2014Updated 11 years ago
sematext / jmxc
View on GitHub
Simple JMX Console
☆17Dec 8, 2012Updated 13 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rclayton / StringSimilarity
View on GitHub
A number of algorithms for calculating string similarity in Java
☆15Jan 23, 2011Updated 15 years ago
organisciak / htrc-book-models
View on GitHub
Within-book topic modeling on HTRC feature extraction files
☆24May 3, 2016Updated 10 years ago
ogrodnek / spark-plug
View on GitHub
scala driver for launching Amazon EMR jobs
☆40Feb 10, 2016Updated 10 years ago
zfz / spark-cs190.1x
View on GitHub
Assignments of CS190.1x, Scalable Machine Learning
☆18Aug 2, 2015Updated 10 years ago
ekarlso / rust-metrics
View on GitHub
☆14Apr 7, 2016Updated 10 years ago
cdapio / coopr
View on GitHub
A template-based cluster provisioning system
☆62Mar 4, 2023Updated 3 years ago
sdhu / elasticsearch-prediction-spark
View on GitHub
Generates Elasticsearch plugin to score/evaluate Spark Trained Models
☆10Apr 25, 2015Updated 11 years ago
trovit / hdfstree
View on GitHub
A command line tool to display HDFS directories as a tree.
☆16Sep 3, 2013Updated 12 years ago
solanolabs / begot
View on GitHub
Go dependency management
☆11May 6, 2016Updated 10 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ezbz / jmxtrans-lib
View on GitHub
JMXTrans configuration for hadoop/cassandra/zookeeper
☆31Dec 3, 2015Updated 10 years ago
erikson84 / BayesDataAnalysisWithPyMC
View on GitHub
Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"
☆65Apr 18, 2017Updated 9 years ago
craigsc / fig.js
View on GitHub
The dead simple way to integrate a user feedback system into any web application.
☆14Jan 3, 2017Updated 9 years ago
ceteri / exelixi
View on GitHub
Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …
☆130Jan 17, 2014Updated 12 years ago
tresata / ganitha
View on GitHub
scalding powered machine learning
☆109Nov 18, 2014Updated 11 years ago
emberlight / sible
View on GitHub
Simple Bluetooth Low Energy Framework for iOS
☆12Feb 17, 2016Updated 10 years ago
uswitch / syslogger
View on GitHub
Forwards syslog messages to Kafka
☆16Oct 19, 2015Updated 10 years ago
bgnkim / ScalaNetwork
View on GitHub
A Neural network implementation with Scala
☆20Jul 17, 2016Updated 10 years ago
metamx / rad-tech-datatypes
View on GitHub
They only live to get radical.
☆13Nov 29, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ept / cap-critique
View on GitHub
Source of paper “A critique of the CAP theorem”
☆16Dec 14, 2015Updated 10 years ago
lastfm / python-mirbuild
View on GitHub
The Last.fm MIR meta-build-system
☆25Oct 14, 2014Updated 11 years ago
jesperborgstrup / Py-IBLT
View on GitHub
A Python implementation of Invertible Bloom Lookup Tables
☆81Oct 23, 2014Updated 11 years ago
aws-samples / aws-ingesting-click-logs-using-terraform
View on GitHub
Provision AWS infrastructure using Terraform (By HashiCorp): an example of web application logging customer data
☆12Dec 19, 2025Updated 7 months ago
databricks / spark-tfocs
View on GitHub
A Spark port of TFOCS: Templates for First-Order Conic Solvers (cvxr.com/tfocs)
☆90Apr 15, 2024Updated 2 years ago
coolwanglu / quantile-alg
View on GitHub
Algorithms for finding quantiles of a data stream
☆20Oct 17, 2013Updated 12 years ago
ContinuumIO / cdx
View on GitHub
☆27Jul 31, 2023Updated 2 years ago