jeppe742 / DeltaLakeReaderView external linksLinks
Read Delta tables without any Spark
☆47Mar 8, 2024Updated last year
Alternatives and similar repositories for DeltaLakeReader
Users that are interested in DeltaLakeReader are comparing it to the libraries listed below
Sorting:
- Redash plugin for Apache Kylin integration☆12Mar 21, 2018Updated 7 years ago
- DataFuse operator manages fuse-query and fuse-store clusters atop Kubernetes using CRDs.☆13Jul 4, 2022Updated 3 years ago
- ☆14Updated this week
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14May 1, 2024Updated last year
- Import Databricks notebooks as libraries/modules☆15Jun 9, 2022Updated 3 years ago
- This project demonstrates how to integrate DuckLake, SQLMesh, and Neon PostgreSQL to create a modern data lakehouse architecture with ver…☆27Jun 3, 2025Updated 8 months ago
- ☆16Oct 17, 2024Updated last year
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- This repository contains step by step instructions on how to finetune Microsoft's Phi-2 model with your own data.☆22Apr 11, 2024Updated last year
- PyTorch Flexible Hash Embeddings☆28Feb 4, 2020Updated 6 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Jul 14, 2022Updated 3 years ago
- dbt Cloud pipelines in airflow examples☆37Oct 30, 2023Updated 2 years ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- ☆31Oct 14, 2019Updated 6 years ago
- Distributed persistent Task Queue running on Dask☆38Apr 23, 2023Updated 2 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 1, 2026Updated last week
- Raw image editor with built-in film emulation.☆17Updated this week
- COMS 4111 Project 1☆12Jul 21, 2022Updated 3 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- See Apache Kylin Website for a complete description☆30May 28, 2018Updated 7 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- A clean online résumé (CV)☆13Jun 6, 2024Updated last year
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- ☆17May 27, 2025Updated 8 months ago
- A Streamlit-based chatbot application using Gemini models for NLP. Features include light/dark mode toggle, model selection (Gemini 1.5 F…☆10May 23, 2024Updated last year
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Delve is a debugger for the Go programming language.☆11Apr 9, 2023Updated 2 years ago
- DeepAlign: Alignment-based Process Anomaly Correction Using Recurrent Neural Networks☆10Mar 25, 2023Updated 2 years ago
- A native Rust library for Delta Lake, with bindings into Python☆3,135Updated this week
- Databricks Migration Tools☆43May 24, 2021Updated 4 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆682Mar 6, 2025Updated 11 months ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago