Thoughts on things I find interesting.
☆17Dec 19, 2024Updated last year
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Nov 16, 2022Updated 3 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- ☆35Dec 2, 2016Updated 9 years ago
- An Extensible Data Skipping Framework☆48Jul 15, 2025Updated 8 months ago
- A third party tool to simulate the calculation result of Flink's memory configuration. Valid for Flink-1.10 and Flink-1.11.☆45Oct 10, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Flink native Kubernetes Operator is a java based control plane for running Apache Flink native application on Kubernetes.☆52Jul 15, 2022Updated 3 years ago
- Next-generation Cassandra Conference, September 26, 2017☆12Aug 23, 2018Updated 7 years ago
- HDFS rsync-like utility to replicate data between HDFS clusters☆17Jun 16, 2012Updated 13 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- Keap is a heap data structure presenting stable PriorityQueue and stable Keapsort sorting algorithm☆14Jan 30, 2024Updated 2 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- ☆11Oct 11, 2022Updated 3 years ago
- ☆10Apr 13, 2020Updated 5 years ago
- 基于袋鼠云提供的开源flinkStreamSQL项目,对其实时sql进行可视化功能开发; 通过tcpip通信,前端页面选择需要连接的数据库信息,并写sql语句,点击提交后,后端自动执行集群启动和JobGraph提交,并返回结果给前端页面。实现了使用者即使不了解Kafka、fl…☆11Jun 23, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆20Mar 9, 2026Updated 2 weeks ago
- A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML☆15Dec 24, 2016Updated 9 years ago
- Massively Scalable Anomaly Detection with Apache Kafka, Cassandra and Kubernetes - final code for Instaclustr's Anomalia Machina Blog ser…☆15May 22, 2019Updated 6 years ago
- ☆14Aug 23, 2015Updated 10 years ago
- Presto Gateway routes query based on policy.☆12Sep 15, 2020Updated 5 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- A pyspark lib to validate data quality☆18Nov 11, 2022Updated 3 years ago
- 提供了solr到elasticsearch的语法翻译引擎,兼容现有的solr语法,提供了基于注解的ORM实现☆12Oct 8, 2015Updated 10 years ago
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16Oct 3, 2025Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Example setup of Flink cluster on Kubernetes with service discovery on Prometheus.☆16Nov 30, 2019Updated 6 years ago
- ☆11Jul 18, 2021Updated 4 years ago
- 录制Spak视频课程讲解涉及编写的源代码 https://edu.hellobi.com/course/107/overview☆13Apr 23, 2019Updated 6 years ago
- Due to lack of resources on how to deploy kafka with simple SASL authentication (just username and password) and how to write producer an…☆12Dec 29, 2021Updated 4 years ago
- It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.☆21Jan 5, 2023Updated 3 years ago
- 面向单机与分布式 OLTP/OLAP 场景的可暂停的渐进式 SQL 引擎 (只用于研究)☆12May 11, 2023Updated 2 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 8 years ago
- Tools for faster and optimized interaction with Teradata and large datasets.☆17Jul 11, 2018Updated 7 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Jun 18, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- OCRA: Object-store Cache in Rust for All☆16Sep 29, 2025Updated 5 months ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- ☆491Oct 21, 2022Updated 3 years ago
- Run MNIST inference in Apache Flink☆11Nov 18, 2020Updated 5 years ago
- Spark Structured Streaming JDBC Sink☆16Apr 26, 2021Updated 4 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago