aamargajbhiye/big-data-projects

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aamargajbhiye/big-data-projects)

aamargajbhiye / big-data-projects

This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc

☆34

Alternatives and similar repositories for big-data-projects

Users that are interested in big-data-projects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

phatak-dev / spark-3.0-examples
View on GitHub
Examples of Spark 3.0
☆44Nov 11, 2020Updated 5 years ago
benbovy / xlrenderer
View on GitHub
Render Excel templates using a database and a specification file
☆13Nov 24, 2018Updated 7 years ago
vitillo / spark-hyperloglog
View on GitHub
Algebird's HyperLogLog support for Apache Spark.
☆10Jul 20, 2017Updated 9 years ago
4paradigm / FeatInsight
View on GitHub
FeatInsight is a feature platform based on OpenMLDB
☆22Mar 7, 2025Updated last year
jparkie / Spark2Cassandra
View on GitHub
Spark Library for Bulk Loading into Cassandra
☆12Apr 18, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fabiogouw / OrleansDemo
View on GitHub
Exemplo de uso do Microsoft Orleans para criação de aplicações stateful
☆11Jan 16, 2024Updated 2 years ago
delta-incubator / dynamodb-lock-rs
View on GitHub
Distributed lock backed by Dynamodb
☆11Dec 7, 2023Updated 2 years ago
colbyford / sparkitecture
View on GitHub
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
☆13Oct 27, 2021Updated 4 years ago
learosema / ella-math
View on GitHub
Basic Geometry and Linear Algebra library
☆16Feb 14, 2023Updated 3 years ago
nmukerje / EMR-Hudi-Workshop
View on GitHub
EMR Hudi Workshop content
☆12Dec 10, 2021Updated 4 years ago
Factual / beercode-open
View on GitHub
Open-source code backed by the Factual Beer Guarantee
☆17Nov 19, 2015Updated 10 years ago
woodruffw / libbdiff
View on GitHub
A library for creating and patching binary diffs. Based on bsdiff.
☆11Nov 23, 2014Updated 11 years ago
go-daq / smbus
View on GitHub
smbus provides access to the System Management bus over I2C
☆15Dec 16, 2020Updated 5 years ago
slively / loopback-discover-models
View on GitHub
Simple CLI to discover and write model definitions from an existing datasource.
☆21Jul 19, 2017Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Sshanu / Hierarchical-Word-Sense-Disambiguation-using-WordNet-Senses
View on GitHub
Word Sense Disambiguation using Word Specific models, All word models and Hierarchical models in Tensorflow
☆33Jun 12, 2020Updated 6 years ago
rabiran / Kartoffel
View on GitHub
☆11Dec 22, 2022Updated 3 years ago
unmeshjoshi / reactiveio
View on GitHub
☆14May 14, 2019Updated 7 years ago
dingsai88 / StudyTest
View on GitHub
自己学习用
☆11Sep 23, 2020Updated 5 years ago
zheyuan28 / SparkTaskMetrics
View on GitHub
Task Metrics Explorer
☆14Apr 2, 2019Updated 7 years ago
BryanCutler / SparkArrowFlight
View on GitHub
Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients
☆37Mar 9, 2021Updated 5 years ago
xvshu / ZKManager
View on GitHub
Zookeeper management project under the control of simple rights（简单权限控制下的zookeeper管理项目）
☆12Jun 25, 2018Updated 8 years ago
IdanCo / object2form
View on GitHub
Easily convert javascript objects to html forms
☆14Oct 6, 2018Updated 7 years ago
fabiogouw / spark-aws-messaging
View on GitHub
A custom sink provider for Apache Spark that sends the content of a dataframe to an AWS SQS
☆23Feb 19, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fanlychie / cat-client-mybatis
View on GitHub
MyBatis接入大众点评CAT监控平台
☆11Aug 5, 2018Updated 7 years ago
adsabs / montysolr
View on GitHub
Solr for Astrophysics Data System
☆57Jul 8, 2026Updated 2 weeks ago
subchen / jetbrick-template-2x-samples
View on GitHub
Samples for jetbrick-template-2x
☆10Mar 17, 2017Updated 9 years ago
DomoApps / domo-phoenix
View on GitHub
Build beautiful charts using Domo's powerful charting engine
☆22Jun 25, 2026Updated 3 weeks ago
iheartradio / thomas
View on GitHub
Another A/B test library
☆25Jul 9, 2026Updated last week
MrPowers / bebe
View on GitHub
Filling in the Spark function gaps across APIs
☆50Apr 14, 2021Updated 5 years ago
spoddutur / spark-as-service-using-embedded-server
View on GitHub
This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server
☆50Jul 16, 2023Updated 3 years ago
mahapatra09 / aflux
View on GitHub
☆10Dec 16, 2022Updated 3 years ago
permanentstar / spark-sql-dsv2-extension
View on GitHub
A sql extension build on spark3 datasource v2 api, ex: hive v2 catalog support amoung multi clusters
☆11May 7, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
not-kennethreitz / pipenv-buildpack
View on GitHub
A minimal buildpack for Pipenv.
☆11Feb 13, 2019Updated 7 years ago
snowplow / scala-maxmind-iplookups
View on GitHub
Scala client for MaxMind Geo-IP
☆87Feb 18, 2026Updated 5 months ago
rtahboub / spark-sql-customized-parser
View on GitHub
An experiment to inject a customized parser using SparkSessionExtension
☆16Jan 1, 2018Updated 8 years ago
singe / linuxkit-for-mac
View on GitHub
A method for building LinuxKit images for Docker-CE with custom kernels.
☆21Aug 3, 2023Updated 2 years ago
starburstdata / presto-minio
View on GitHub
Presto and Minio on Docker Infrastructure
☆43Jul 11, 2018Updated 8 years ago
samueltardieu / serialbridge
View on GitHub
Bridge serial ports and TCP sockets
☆17Sep 21, 2015Updated 10 years ago
awslabs / aws-glue-catalog-sync-agent-for-hive
View on GitHub
Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog
☆37Dec 5, 2023Updated 2 years ago