This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc
☆34Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for big-data-projects
Users that are interested in big-data-projects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 19, 2024Updated 2 years ago
- Render Excel templates using a database and a specification file☆13Nov 24, 2018Updated 7 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- EMR Hudi Workshop content☆12Dec 10, 2021Updated 4 years ago
- ☆12Aug 26, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- a tailored Apache Calcite for Apache Kylin, more details at http://mail-archives.apache.org/mod_mbox/kylin-dev/201704.mbox/%3CCAF7etT=wEB…☆14Nov 7, 2025Updated 5 months ago
- smbus provides access to the System Management bus over I2C☆15Dec 16, 2020Updated 5 years ago
- A reasonably complete and well-tested golang port of httpbin, with zero dependencies outside the go stdlib.☆11Nov 24, 2025Updated 4 months ago
- A library for creating and patching binary diffs. Based on bsdiff.☆11Nov 23, 2014Updated 11 years ago
- ☆10Oct 26, 2016Updated 9 years ago
- Example express server demonstrating private embed with programmatic filtering☆18Sep 18, 2025Updated 7 months ago
- Task Metrics Explorer☆14Apr 2, 2019Updated 7 years ago
- GitHub Action to lint with scalafmt☆12Jul 13, 2022Updated 3 years ago
- Word Sense Disambiguation using Word Specific models, All word models and Hierarchical models in Tensorflow☆33Jun 12, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 2 years ago
- MyBatis接入大众点评CAT监控平台☆11Aug 5, 2018Updated 7 years ago
- Custom datasource about spark structure streaming☆12Jan 29, 2019Updated 7 years ago
- A MVP implementation of distributed query engine cut from datafusion-ballista codebase for learning purpose.☆12Jan 10, 2025Updated last year
- Solr for Astrophysics Data System☆55Feb 10, 2026Updated 2 months ago
- A free open unofficial stacktrace translator for Zelix KlassMaster☆10Apr 10, 2019Updated 7 years ago
- Samples for jetbrick-template-2x☆11Mar 17, 2017Updated 9 years ago
- 自己学习用☆11Sep 23, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server☆50Jul 16, 2023Updated 2 years ago
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated 2 years ago
- Build beautiful charts using Domo's powerful charting engine☆22Nov 15, 2024Updated last year
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Another A/B test library☆25Mar 19, 2026Updated last month
- ☆15May 19, 2019Updated 6 years ago
- A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters☆12May 7, 2022Updated 3 years ago
- A starter kit for public apps that utilize the HubSpot API☆11Jul 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Dec 16, 2022Updated 3 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆16Jan 1, 2018Updated 8 years ago
- Java client for Hawkular☆11Mar 16, 2017Updated 9 years ago
- xxhash-64 in 20 lines☆25Jun 29, 2024Updated last year
- A minimal buildpack for Pipenv.☆11Feb 13, 2019Updated 7 years ago
- tikv-importer is a front-end to help ingesting large number of KV pairs into a TiKV cluster☆20Mar 20, 2023Updated 3 years ago
- ☆40Mar 10, 2026Updated last month