Data Sketches for Apache Spark
☆22Dec 22, 2022Updated 3 years ago
Alternatives and similar repositories for datasketches-spark
Users that are interested in datasketches-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An efficient C hash-table like data structure with static size that evicts LRU object on insertion☆11Sep 10, 2023Updated 2 years ago
- This repo stores my Spark Tutorial slides.☆15Feb 8, 2016Updated 10 years ago
- Amundsen Gremlin☆22Aug 26, 2022Updated 3 years ago
- Algebird's HyperLogLog support for Apache Spark.☆10Jul 20, 2017Updated 8 years ago
- A library for writing chemical and biological data management systems☆10Oct 24, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- High performance Privacy By Design using Matryoshka and Spark talk code☆13May 21, 2019Updated 6 years ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Example of how to set SBT up for local development of AWS Glue Scripts☆16Jan 4, 2021Updated 5 years ago
- RAPIDS Accelerator JNI For Apache Spark☆56Updated this week
- native Go library for Delta Lake☆10Jul 31, 2022Updated 3 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Some Avro operations in Scala☆10Mar 19, 2026Updated last month
- A brief presentation comparing Scala with Kotlin aimed toward Scala FP devs at 47 Degrees☆40Dec 23, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14May 23, 2017Updated 8 years ago
- A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters☆12May 7, 2022Updated 3 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Jan 20, 2026Updated 2 months ago
- Trying to code clean and delegate code to small functions as much as possible☆25May 14, 2024Updated last year
- A discrete, colored Petri Net DSL and executor☆17Aug 9, 2022Updated 3 years ago
- a repo for how to set up xubuntu like me☆29Aug 5, 2023Updated 2 years ago
- User tools for Spark RAPIDS☆70Apr 10, 2026Updated last week
- Notebook Discovery Tool for Databricks notebooks☆19Jul 14, 2022Updated 3 years ago
- An Erlang ingester for GreptimeDB, which is compatible with GreptimeDB protocol and lightweight.☆16Mar 4, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Fast RFC 3339 (ISO 8601) timestamp parser and formatter implemented in C, zero dependencies.☆38Feb 13, 2014Updated 12 years ago
- Dione - a Spark and HDFS indexing library☆53Mar 26, 2026Updated 3 weeks ago
- Golang JSON Specification API Inspired By JSH☆15Jul 19, 2016Updated 9 years ago
- Plan B Token Info service for JWT tokens☆17May 17, 2017Updated 8 years ago
- DDSketch: A Fast and Fully-Mergeable Quantile Sketch with Relative-Error Guarantees.☆129Mar 26, 2026Updated 3 weeks ago
- SHAPE/S∀F∃: static prover/type-checker for N-D array programming in Scala, a use case of intuitionistic type theory☆32Sep 9, 2025Updated 7 months ago
- PostgreSQL extension providing approximate algorithms based on apache/datasketches-cpp☆91Jul 2, 2025Updated 9 months ago
- An online book.☆11Jan 24, 2015Updated 11 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SCARFF (SCAlable Real-time Frauds Finder) is a framework which enables credit card fraud detection.☆19Feb 8, 2017Updated 9 years ago
- A Rust ingester for GreptimeDB, which is compatible with GreptimeDB protocol and lightweight.☆24Apr 8, 2026Updated last week
- A library for Amazon Neptune that enables AWS Signature Version 4 signing for HTTP using Netty.☆17Oct 21, 2025Updated 5 months ago
- ☆12Mar 12, 2021Updated 5 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 3 months ago
- Core C++ Sketch Library☆257Mar 28, 2026Updated 3 weeks ago
- A fast user agent string parser for Go.☆60Jun 1, 2017Updated 8 years ago