The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage of native Apache Spark features. You can still combine it with standard Spark code.
☆31Jun 18, 2025Updated 9 months ago
Alternatives and similar repositories for almaren-framework
Users that are interested in almaren-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Quenya-DSL(Domain Specific Language) is a language that simplifies the task to parser complex semi-structured data☆10Sep 22, 2023Updated 2 years ago
- Emacs config☆10Updated this week
- Apache DataFusion Benchmarks☆21Mar 3, 2026Updated last month
- Framework for Parsing and Formatting POD☆44Feb 8, 2026Updated 2 months ago
- Scala and SQL happy together.☆29Dec 13, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A generic ETL framework with Spark_SQL for transforming data by constructing pipelines with Yaml/Json/Xml☆20Feb 3, 2026Updated 2 months ago
- ☆20Aug 10, 2021Updated 4 years ago
- An implementation of the Perl programming language designed to run on the Java platform☆53Updated this week
- Artifacts of the EKGF Data Product Workgroup (DPROD)☆36Apr 1, 2026Updated last week
- A checklist for releasing a CPAN module☆16Jan 25, 2026Updated 2 months ago
- Bring new language features and popular DSLs into cperl-mode☆23Jan 26, 2026Updated 2 months ago
- Resources for security engineer job search.☆11Jan 25, 2026Updated 2 months ago
- Use the Amazon S3 - Simple Storage Service from Perl☆13Aug 17, 2024Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- colored pretty-print of Perl data structures and objects☆100Jul 31, 2024Updated last year
- A chrome extension draws pm2.5 IDW diagram data of Taiwan on Windy.com☆12Nov 29, 2017Updated 8 years ago
- ☆16Apr 1, 2026Updated last week
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- An always up to date collection of useful tools for your Kubernetes linting and auditing needs.☆16Updated this week
- CLI for the Imposter mock engine, a scriptable, multipurpose mock server.☆18Updated this week
- A Vim script to return info about the Git branches.☆69Jul 31, 2015Updated 10 years ago
- Discover how you can migrate from traditional deployments to serverless architectures with AWS☆12Feb 1, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- FUSE plugin for the Google Cloud Healthcare DICOM API☆18Oct 4, 2023Updated 2 years ago
- Java Code Isolation☆17Apr 26, 2021Updated 4 years ago
- A neural network written completely in jq☆17Apr 30, 2017Updated 8 years ago
- Apache Arrow Flight example☆11Nov 9, 2020Updated 5 years ago
- A simple dsl for criteria and hql with scala☆19Aug 30, 2011Updated 14 years ago
- ☆11Nov 26, 2024Updated last year
- I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perfo…☆10Oct 20, 2017Updated 8 years ago
- A snippet for running multiple, concurrent invocations of a Python function☆24Mar 23, 2026Updated 2 weeks ago
- C# LZW Decoder Library☆14Aug 12, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pager for tabular data and SQL output☆12Mar 29, 2023Updated 3 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Jan 30, 2023Updated 3 years ago
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16May 11, 2021Updated 4 years ago
- Convenient pyarrow operations following the Pandas API☆45Jan 30, 2022Updated 4 years ago
- ☆14Jan 12, 2017Updated 9 years ago
- kdevops main repository: Generalized devops infrastructure for Linux kernel development☆32Jan 8, 2026Updated 3 months ago