Read Delta tables without any Spark
☆47Mar 8, 2024Updated last year
Alternatives and similar repositories for DeltaLakeReader
Users that are interested in DeltaLakeReader are comparing it to the libraries listed below
Sorting:
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Mar 6, 2025Updated last year
- Delta Lake Examples☆11Apr 24, 2020Updated 5 years ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14May 1, 2024Updated last year
- tidyspark: a tidyverse implementation of SparkR built for simplicity, elegance, and ease of use.☆22Sep 25, 2020Updated 5 years ago
- Import Databricks notebooks as libraries/modules☆15Jun 9, 2022Updated 3 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- This repository contains step by step instructions on how to finetune Microsoft's Phi-2 model with your own data.☆23Apr 11, 2024Updated last year
- A Spark Publish/Subscribe NATS Connector☆27Oct 12, 2020Updated 5 years ago
- High-performance data retrieval from Neo4j with Apache Arrow 🏹☆32Jul 14, 2022Updated 3 years ago
- PyTorch Flexible Hash Embeddings☆28Feb 4, 2020Updated 6 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Dec 5, 2023Updated 2 years ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Distributed persistent Task Queue running on Dask☆38Apr 23, 2023Updated 2 years ago
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Feb 15, 2023Updated 3 years ago
- Repository for the dbt Semantic Layer course☆12Updated this week
- COMS 4111 Project 1☆12Jul 21, 2022Updated 3 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- Starter Repo for a Flask backend and Vuejs frontend using Docker☆10Sep 24, 2018Updated 7 years ago
- Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.☆30Sep 29, 2023Updated 2 years ago
- See Apache Kylin Website for a complete description☆30May 28, 2018Updated 7 years ago
- A clean online résumé (CV)☆13Jun 6, 2024Updated last year
- The Snapwell wellpath optimization tool☆11Dec 17, 2024Updated last year
- Reproducible Research in Finse☆10Aug 5, 2020Updated 5 years ago
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated last year
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Hackerank Programming Challenges☆10May 8, 2021Updated 4 years ago
- A native Rust library for Delta Lake, with bindings into Python☆3,160Updated this week
- Databricks Migration Tools☆43May 24, 2021Updated 4 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆683Mar 6, 2025Updated last year
- Collection of AI conferences☆14Aug 16, 2022Updated 3 years ago
- ☆10Nov 11, 2016Updated 9 years ago
- ansible with kubernetes☆10Feb 14, 2023Updated 3 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago