Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL
☆97Jan 22, 2026Updated last month
Alternatives and similar repositories for spark-sas7bdat
Users that are interested in spark-sas7bdat are comparing it to the libraries listed below
Sorting:
- lightweight Java library designed to read SAS7BDAT datasets☆76Jan 26, 2026Updated last month
- gsp-sqlparser☆13Mar 28, 2018Updated 7 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆46Jan 27, 2025Updated last year
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 5 months ago
- A Java Based Query Engine For JSON☆12Nov 23, 2014Updated 11 years ago
- agogosml is a flexible data processing pipeline that addresses the common need for operationalizing ML models at scale☆34May 3, 2019Updated 6 years ago
- Drag'n'Drop TabPane extension for standard JavaFX☆12Nov 12, 2024Updated last year
- A super lightweight yet powerful Object Relational Mapping (ORM) Framework for Java and SQLite 3☆12May 17, 2014Updated 11 years ago
- Pandas-aware non-linear least squares regression using Lmfit☆10Aug 15, 2016Updated 9 years ago
- Example code for doing DataOps☆49Jan 26, 2021Updated 5 years ago
- This is a GUI template for JavaFX projects. It has a basic layout already created and custom loggin feature.☆11May 28, 2018Updated 7 years ago
- customized cloudera-parcel☆13Feb 3, 2019Updated 7 years ago
- Spark Structured Streaming State Tools☆34Jul 3, 2020Updated 5 years ago
- ******* In this fork I only work on the r/ directory, please refer to the upstream repo for all of Arrow******☆15Feb 3, 2022Updated 4 years ago
- Use Python within Stata☆19Jul 28, 2014Updated 11 years ago
- A parser for Oracle PL/SQL in written in java. Complete for packages. But no conditional compilation.☆17Jun 15, 2019Updated 6 years ago
- Kafka sink for Kusto☆51Mar 2, 2026Updated last week
- Sample SAS programs that use SAS/ACCESS engines to connect to your data source. Use these for testing and for learning!☆26Nov 7, 2024Updated last year
- R package for dataset generation and benchmarking☆22Jan 20, 2020Updated 6 years ago
- Cookie cutter for JupyterLab mimerenderer extensions using TypeScript☆20Aug 8, 2023Updated 2 years ago
- 🔄 A command-line utility to export Protocol Buffers (proto) files to YAML and JSON☆19Apr 25, 2024Updated last year
- R Package for WebHDFS REST API☆18Apr 15, 2019Updated 6 years ago
- Demonstrates calling a Scala UDF from Python using spark-submit with an EGG and JAR☆23Mar 3, 2020Updated 6 years ago
- Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)☆21May 19, 2021Updated 4 years ago
- Major mode for editing Stata files in Emacs☆18Feb 10, 2026Updated 3 weeks ago
- ☆23Dec 10, 2019Updated 6 years ago
- Dashboard implementation for Shiny☆45Jan 19, 2020Updated 6 years ago
- Standard Dialogs for JavaFX 2☆49Apr 6, 2017Updated 8 years ago
- Data-driven software (python implementation)☆26Oct 20, 2023Updated 2 years ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 8 months ago
- Stata module to add U.S. state identifiers to dataset.☆23Feb 5, 2022Updated 4 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Jun 17, 2025Updated 8 months ago
- A reusable component framework for visualization and data manipulation.☆20Mar 2, 2019Updated 7 years ago
- Tools for Standardizing Variables for Regression in R☆22Mar 5, 2021Updated 5 years ago
- PyTorch Flexible Hash Embeddings☆28Feb 4, 2020Updated 6 years ago
- R for Reproducible Research tutorial☆28Jun 13, 2017Updated 8 years ago
- Out-of-core statistical computing and signal processing☆60Feb 18, 2026Updated 2 weeks ago
- Automagically Convert purrr R Calls to Efficient Julia for Loops☆47Mar 31, 2019Updated 6 years ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago