music-of-the-ainur/almaren-framework

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/music-of-the-ainur/almaren-framework)

music-of-the-ainur / almaren-framework

The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage of native Apache Spark features. You can still combine it with standard Spark code.

☆31

Alternatives and similar repositories for almaren-framework

Users that are interested in almaren-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / datafusion-benchmarks
View on GitHub
Apache DataFusion Benchmarks
☆23May 2, 2026Updated 2 months ago
criteo / vizsql
View on GitHub
Scala and SQL happy together.
☆29Dec 13, 2016Updated 9 years ago
supermariolabs / spooq
View on GitHub
☆39Jun 17, 2026Updated last month
aws-samples / aws-lakeformation-access-controls-automation
View on GitHub
☆20Aug 10, 2021Updated 4 years ago
MIT-LCP / 2019_toronto_health_hack
View on GitHub
2019 Toronto Datathon https://www.tdothealthhack.com
☆11Oct 4, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
meisheep / taiwan-pm2_5-idw-map-on-windyty
View on GitHub
A chrome extension draws pm2.5 IDW diagram data of Taiwan on Windy.com
☆12Nov 29, 2017Updated 8 years ago
chgl / kube-powertools
View on GitHub
An always up to date collection of useful tools for your Kubernetes linting and auditing needs.
☆16Updated this week
imposter-project / imposter-cli
View on GitHub
CLI for the Imposter mock engine, a scriptable, multipurpose mock server.
☆19Updated this week
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
noetl / noetl
View on GitHub
Automation, Data Mash, Message Learning, AI Ops, Quantum Ops
☆14Updated this week
leanderloew / ES-RNN-Pytorch
View on GitHub
This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition
☆12Apr 9, 2019Updated 7 years ago
TrainingByPackt / Serverless-Architectures-with-AWS
View on GitHub
Discover how you can migrate from traditional deployments to serverless architectures with AWS
☆12Feb 1, 2019Updated 7 years ago
GoogleCloudPlatform / healthcare-api-dicom-fuse
View on GitHub
FUSE plugin for the Google Cloud Healthcare DICOM API
☆18Oct 4, 2023Updated 2 years ago
criteo / berilia
View on GitHub
Create hadoop cluster in aws ec2 for development
☆11Sep 8, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rss161030 / ETL-processes-using-Sqoop-Hadoop-Hive-Spark-and-Scala
View on GitHub
I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perfo…
☆10Oct 20, 2017Updated 8 years ago
stanch / zipper
View on GitHub
An implementation of Huet’s Zipper for Scala and Scala.js that is intended to be usable in many common scenarios
☆49Aug 18, 2024Updated last year
paulmw / hive-udf
View on GitHub
☆16Apr 17, 2014Updated 12 years ago
aws-samples / apache-xtable-on-aws-samples
View on GitHub
☆11Jun 8, 2026Updated last month
arkady-emelyanov / pyarrow-flight
View on GitHub
Apache Arrow Flight example
☆10Nov 9, 2020Updated 5 years ago
RohanAdwankar / share-df
View on GitHub
Python Package to Share/Edit Pandas/Polars DF with web interface!
☆11Jun 21, 2026Updated last month
alekseyig / spark-submit-deps
View on GitHub
☆14Jan 12, 2017Updated 9 years ago
erik / vxsv
View on GitHub
Pager for tabular data and SQL output
☆12Mar 29, 2023Updated 3 years ago
bigdata-icict / ETL-Dataiku-DSS
View on GitHub
☆18Dec 18, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rhodesrt / ML_exercises
View on GitHub
Associated blog post - https://tristanrhodes.com/blog/Adventures-in-Algorithmic-Trading-on-the-Runescape-Grand-Exchange
☆10Oct 14, 2024Updated last year
PacktPublishing / GCP-Complete-Google-Data-Engineer-and-Cloud-Architect-Guide-v-
View on GitHub
Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt
☆16Jan 30, 2023Updated 3 years ago
marco-roy / DDO
View on GitHub
A DBT package to perform DataOps & administrative CI/CD on your data warehouse.
☆16May 11, 2021Updated 5 years ago
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
cerndb / SparkPlugins
View on GitHub
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…
☆96May 11, 2026Updated 2 months ago
netease-bigdata / ne-spark-courseware
View on GitHub
NetEase Spark Courses
☆15Sep 4, 2018Updated 7 years ago
bonn0062 / flask_model_deployment
View on GitHub
This is the official repo for the Heartbeat article, "The brilliant beginner's guide to model deployment: a clear and simple roadmap for …
☆20Feb 22, 2019Updated 7 years ago
Manouchehri / presage
View on GitHub
An intelligent predictive text entry platform. Mirror of git://git.code.sf.net/p/presage/presage Please send reports to the SourceForge b…
☆11Aug 17, 2015Updated 10 years ago
bartosz25 / acid-file-formats
View on GitHub
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
☆10Feb 2, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mhausenblas / hadoop-data-ingestion
View on GitHub
Renders options for ingesting data into Hadoop
☆21Jun 18, 2013Updated 13 years ago
databricks-industry-solutions / omop-cdm
View on GitHub
Unlocking the Power of Health Data With a Modern Data Lakehouse
☆29Mar 29, 2026Updated 3 months ago
open-metadata / openmetadata-sqllineage
View on GitHub
SQL Lineage Analysis Tool powered by Python
☆20Aug 25, 2023Updated 2 years ago
informagi / GeeseDB
View on GitHub
Graph Engine for Exploration and Search
☆42Jan 26, 2024Updated 2 years ago
adidas / datamesh-sharing-data-at-scale
View on GitHub
adidas Data Mesh implementation
☆12May 13, 2022Updated 4 years ago
geordielad / tableau-athena-credential-provider-examples
View on GitHub
How to customize Tableau authentication using the AWS Athena's JDBC Credentials Provider capabilites.
☆14Jun 8, 2020Updated 6 years ago
alexwlchan / concurrently
View on GitHub
A snippet for running multiple, concurrent invocations of a Python function
☆24May 17, 2026Updated 2 months ago