Read Delta tables without any Spark
☆47Mar 8, 2024Updated 2 years ago
Alternatives and similar repositories for DeltaLakeReader
Users that are interested in DeltaLakeReader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- tidyspark: a tidyverse implementation of SparkR built for simplicity, elegance, and ease of use.☆22Sep 25, 2020Updated 5 years ago
- fsspec-compatible Azure Blob and Data Lake Storage (Gen2) access☆207Apr 9, 2026Updated 3 weeks ago
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 8 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- Databricks Migration Tools☆43May 24, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Mar 6, 2025Updated last year
- Simple, asynchronous job queueing library for Python on top of NATS.io☆17Sep 4, 2025Updated 8 months ago
- Import Databricks notebooks as libraries/modules☆15Jun 9, 2022Updated 3 years ago
- This repository contains step by step instructions on how to finetune Microsoft's Phi-2 model with your own data.☆23Apr 11, 2024Updated 2 years ago
- Artfully create commit messages that reflect the essence of your code changes. Craftsmanship for your commits.☆20Jun 8, 2025Updated 10 months ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Python Client for Microsoft Project Oxford☆10Jun 7, 2016Updated 9 years ago
- Python Localstack Examples☆11Mar 3, 2026Updated 2 months ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Feb 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.☆15Sep 3, 2021Updated 4 years ago
- Data sources for Elastic Map Service☆23Updated this week
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- VSCode extension to work with Databricks☆134Mar 31, 2026Updated last month
- High-level HTTP clients for Python.☆17Apr 14, 2026Updated 3 weeks ago
- ☆11Sep 23, 2019Updated 6 years ago
- ☆10Jul 22, 2021Updated 4 years ago
- Dynamic dispatch over arbitrary predicates☆10Feb 2, 2016Updated 10 years ago
- Converts keras trained models to frozen tensorflow protocol buffers for use with the c++ tensorflow api☆10Sep 28, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Feature selection for machine learning using mutual information.☆15Dec 4, 2024Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 10 months ago
- scripts for using spark on janelia's cluster☆12Nov 12, 2023Updated 2 years ago
- Python module to align a simple (not nested) list in columns. Adapted from the routine of the same name inside cmd.py☆16Dec 12, 2025Updated 4 months ago
- Unofficial Python client for Azure cognitive search☆11Jun 7, 2019Updated 6 years ago
- Reproducible Research in Finse☆10Aug 5, 2020Updated 5 years ago
- ☆11Jul 13, 2021Updated 4 years ago
- Sample Python codes and code snippets on how to use Azure Cognitive Services APIs such as face recognition, text analytics, topic detecti…☆13Sep 19, 2017Updated 8 years ago
- A curated list of just the right amount of resources on causal inference.☆14Oct 28, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python interface to Mollie.nl iDEAL API for use in Django projects.☆19Nov 20, 2017Updated 8 years ago
- Code that was used as an example during the Data+AI Summit 2020☆15Mar 8, 2021Updated 5 years ago
- Example gaming leaderboard application covering streaming ingestion, CDC enrichment, processing and visualisation including demo of advan…☆21Nov 18, 2025Updated 5 months ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Jul 14, 2022Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- ResumeItNow is a free, open-source resume builder that helps job seekers create professional resumes without watermarks or hidden fees. B…☆26Feb 10, 2026Updated 2 months ago
- VPS Setup Script - User Management, Security, Docker, System Updates, Coolify Installation☆39Jan 16, 2025Updated last year