PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆17Jan 12, 2017Updated 9 years ago
Alternatives and similar repositories for pyspark-atlas
Users that are interested in pyspark-atlas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- json files and rest calls to add custom atlas types and create entities☆12Mar 27, 2017Updated 9 years ago
- Examples of cellulose projects☆13Oct 19, 2015Updated 10 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Ambari stack service for easily installing and managing Solr on HDP cluster☆18Nov 13, 2018Updated 7 years ago
- This project compose of two parts: 1) write, spark job to write to hbase using bulk load to; 2)read, rest api reading from hbase base on …☆20Oct 25, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- minio as local storage and DynamoDB as catalog☆15May 14, 2024Updated last year
- Simple role for deploying Elixir Exrm releases.☆10Jan 28, 2016Updated 10 years ago
- Adventures in robotics with Mindstorm EV3 and Elixir☆12Dec 30, 2019Updated 6 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Jan 11, 2017Updated 9 years ago
- golang tools for Apache Solr☆28Jan 29, 2026Updated 3 months ago
- Test pages listed in a sitemap.☆10Jan 7, 2015Updated 11 years ago
- SBT template for projects written in Scala and other JVM languages☆13Dec 29, 2021Updated 4 years ago
- pivottablejs for air-gapped systems☆13Aug 14, 2024Updated last year
- A starter template for creating web applications with Google Apps Script & Svelte☆10Oct 20, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kafka Connect Converter using JSONSchema☆15Oct 5, 2022Updated 3 years ago
- Example project demonstrating easy, concise and typechecked JDBC access☆10Feb 9, 2018Updated 8 years ago
- RedRock - Mobile Application prototype using Apache Spark, Twitter and Elasticsearch☆14Sep 10, 2018Updated 7 years ago
- Radio is a DuckDB extension by Query.Farm that brings real-time event streams into your SQL workflows. It enables DuckDB to receive and s…☆38Mar 29, 2026Updated last month
- ipywidgets GUI elements for HyperSpy☆11May 1, 2026Updated last week
- ERPL is a DuckDB extension to connect to API based ecosystems via standard interfaces like OData, GraphQL and REST. This works e.g. for S…☆27May 2, 2026Updated last week
- A collection of python utility functions☆11Apr 30, 2026Updated last week
- A rails tagging gem implementing flickr's machine tags + maybe more (semantic tags)☆44May 1, 2013Updated 13 years ago
- Postgres protocol support for finagle☆36Sep 4, 2013Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- fuzzy matchers for chai, based on underscore☆24Mar 17, 2016Updated 10 years ago
- Amp Rack Guitar Effects Processor for Linux and Windows☆18Mar 26, 2025Updated last year
- Samples of authenticating to an Azure Key Vault vault☆13May 10, 2022Updated 3 years ago
- ☆21May 5, 2016Updated 10 years ago
- A implementation of the Self-Tuning Spectral Clustering algorithm, and more.☆12Sep 4, 2016Updated 9 years ago
- Apache Solr interpreter for Apache Zeppelin☆28Jun 14, 2023Updated 2 years ago
- Golang Zipkin Tracing Client☆18Jan 8, 2018Updated 8 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Oct 14, 2019Updated 6 years ago
- ansible plugins used by xiaomi☆10Oct 13, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Feb 20, 2020Updated 6 years ago
- A Spark datasource for the HadoopOffice library☆36Sep 29, 2025Updated 7 months ago
- Ambari Service for OpenTSDB☆34Dec 14, 2016Updated 9 years ago
- ☆15Apr 13, 2026Updated 3 weeks ago
- Automated log shipper for Kubernetes powered by annotations☆10Sep 10, 2019Updated 6 years ago
- Mozilla Foundation DevOps Plans, Issues, Discussions☆14Dec 8, 2022Updated 3 years ago
- Scala Exercises' lessons for the Doobie library☆15Mar 31, 2023Updated 3 years ago