This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc
☆34Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for big-data-projects
Users that are interested in big-data-projects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 19, 2024Updated 2 years ago
- FeatInsight is a feature platform based on OpenMLDB☆22Mar 7, 2025Updated last year
- Spark Library for Bulk Loading into Cassandra☆12Apr 18, 2018Updated 8 years ago
- Distributed lock backed by Dynamodb☆11Dec 7, 2023Updated 2 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple Go 1.8 plugin test for https://jeremywho.com/go-1.8---plugins/☆10Feb 28, 2017Updated 9 years ago
- grpc-connection-library that supports the gRPC client-server connection interface for the developers to use as a gRPC middleware in the a…☆14Aug 19, 2021Updated 4 years ago
- A reasonably complete and well-tested golang port of httpbin, with zero dependencies outside the go stdlib.☆11Nov 24, 2025Updated 7 months ago
- A library for creating and patching binary diffs. Based on bsdiff.☆11Nov 23, 2014Updated 11 years ago
- ☆10Oct 26, 2016Updated 9 years ago
- C 结构体与 JSON 快速互转库☆10Nov 27, 2017Updated 8 years ago
- Repository template for katas in Go and VS Code.☆11Jun 22, 2026Updated last week
- Task Metrics Explorer☆14Apr 2, 2019Updated 7 years ago
- A complete golang implementation of Common industrial protocol☆11Dec 26, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- Provide functionality to build statistical models to repair dirty tabular data in Spark☆12Apr 21, 2023Updated 3 years ago
- Custom datasource about spark structure streaming☆12Jan 29, 2019Updated 7 years ago
- A MVP implementation of distributed query engine cut from datafusion-ballista codebase for learning purpose.☆12Jan 10, 2025Updated last year
- Samples for jetbrick-template-2x☆10Mar 17, 2017Updated 9 years ago
- A starter-kit to build restful api's with the awesome golang☆16Sep 26, 2017Updated 8 years ago
- This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server☆50Jul 16, 2023Updated 2 years ago
- A custom sink provider for Apache Spark that sends the content of a dataframe to an AWS SQS☆23Feb 19, 2026Updated 4 months ago
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Zookeeper management project under the control of simple rights(简单权限控制下的zookeeper管理项目)☆12Jun 25, 2018Updated 8 years ago
- Filling in the Spark function gaps across APIs☆50Apr 14, 2021Updated 5 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Scala client for MaxMind Geo-IP☆87Feb 18, 2026Updated 4 months ago
- TiKV Client for C++☆15Nov 27, 2023Updated 2 years ago
- A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters☆11May 7, 2022Updated 4 years ago
- A starter kit for public apps that utilize the HubSpot API☆11Jul 19, 2023Updated 2 years ago
- ☆10Dec 16, 2022Updated 3 years ago
- An experiment to inject a customized parser using SparkSessionExtension☆16Jan 1, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Go RESTful API server with gin and docker☆18Jul 18, 2019Updated 6 years ago
- A network traffic relay☆23Mar 14, 2025Updated last year
- ☆11Sep 13, 2020Updated 5 years ago
- Java client for Hawkular☆11Mar 16, 2017Updated 9 years ago
- A minimal buildpack for Pipenv.☆11Feb 13, 2019Updated 7 years ago
- This repo is my settings for using the local LLM with graphrag & an UI to chat with the index result☆16Jul 24, 2024Updated last year
- Presto and Minio on Docker Infrastructure☆43Jul 11, 2018Updated 7 years ago