Self-written notes that may be useful
☆107Dec 26, 2015Updated 10 years ago
Alternatives and similar repositories for MyNotes
Users that are interested in MyNotes are comparing it to the libraries listed below
Sorting:
- A Spark Reliability Testing Suite☆13Jan 10, 2017Updated 9 years ago
- GIS extension for SparkSQL☆39Jan 25, 2016Updated 10 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- Interactive shell for sqlite-utils using litecli☆14Jul 27, 2023Updated 2 years ago
- 一个比Spark-Parquet还快5~100倍的存储格式☆12Feb 22, 2016Updated 10 years ago
- My blogs☆47Apr 13, 2016Updated 9 years ago
- ☆12Oct 13, 2016Updated 9 years ago
- Datasets and notebooks☆13Oct 26, 2016Updated 9 years ago
- Source code of Blog at☆51Sep 17, 2025Updated 5 months ago
- PubNative Dockerfiles library☆16Jan 15, 2026Updated last month
- CSV file loader for HBase and Cassandra☆17May 12, 2021Updated 4 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- A push-based java stream/reactive library☆13Mar 24, 2016Updated 9 years ago
- Python library to make it easy to upsert on MySQL, PostgreSQL, and SQLite3.☆18Apr 14, 2023Updated 2 years ago
- ☆17Jan 25, 2017Updated 9 years ago
- Study guide for Microsoft Azure Exam AZ-300☆18Apr 23, 2019Updated 6 years ago
- Notes talking about the design and implementation of Apache Spark☆5,360Apr 2, 2024Updated last year
- 极客班第一期学员的C++作业请提交到这里,按照自己的学生编号创建文件夹☆12Oct 26, 2015Updated 10 years ago
- MySQL to NoSQL real time dataflow☆19Oct 14, 2017Updated 8 years ago
- Learning to write Spark examples☆161Aug 20, 2014Updated 11 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Pytest plugin that runs PyStack on slow or hanging tests.☆20Nov 6, 2025Updated 3 months ago
- (Under Development) Extract features from text and links. Useful for machine learning algorithms.☆23Nov 22, 2022Updated 3 years ago
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Nov 25, 2014Updated 11 years ago
- conbine flume,spark-streaming and redis for real-time computing☆22Oct 20, 2014Updated 11 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Feb 17, 2021Updated 5 years ago
- A K8s-based infrastructure for analytics☆24Jan 15, 2020Updated 6 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Jul 3, 2023Updated 2 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆50Oct 31, 2014Updated 11 years ago
- Pig on Apache Spark☆82Mar 23, 2015Updated 10 years ago
- Common metadata layer for Hadoop's Map Reduce, Pig, and Hive☆76Feb 17, 2011Updated 15 years ago
- REST job server for Apache Spark☆2,842Jul 8, 2025Updated 7 months ago
- MLlib Convolutional and Feedforward Neural Network implementation with a high level API and advanced optimizers.☆27Aug 30, 2017Updated 8 years ago
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- Cantor provides utilities for estimating the cardinality of large sets.☆84Apr 12, 2022Updated 3 years ago
- Dumping ground for random stuff☆55Jun 14, 2025Updated 8 months ago
- 挖坑与填坑☆687Aug 18, 2016Updated 9 years ago
- An evolutionary algorithm-based optimization for tracking weights in the OpenSim Residual Reduction Algorithm (RRA).☆11Jul 17, 2023Updated 2 years ago