Data Quality Monitoring Tool
☆15Dec 5, 2017Updated 8 years ago
Alternatives and similar repositories for data-quality-monitoring
Users that are interested in data-quality-monitoring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorial and examples of Data Quality in Big Data System☆11Apr 25, 2017Updated 9 years ago
- Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http:…☆73Jan 1, 2023Updated 3 years ago
- Scripts to demonstrate VPC Service Controls between tenant and shared projects☆12Jun 11, 2019Updated 7 years ago
- Delta Lake Examples☆11Apr 24, 2020Updated 6 years ago
- ☆12May 16, 2017Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- A modern API to get information from the RATP☆13Jul 12, 2023Updated 2 years ago
- Ansible scripts for deploying Kafka on EC2☆10Oct 7, 2016Updated 9 years ago
- ☆10Jan 28, 2025Updated last year
- Data quality tools for Big Data☆19Oct 10, 2019Updated 6 years ago
- ☆12Jun 1, 2021Updated 5 years ago
- File compaction tool that runs on top of the Spark framework.☆59Apr 17, 2019Updated 7 years ago
- The one file simple bug tracking application that incorporates a kanban board.☆12Jan 31, 2014Updated 12 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- ☆20Apr 27, 2012Updated 14 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Jan 11, 2019Updated 7 years ago
- Asynchronous Scala Clients for Amazon Web Services☆13Jul 31, 2017Updated 8 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆93Dec 29, 2022Updated 3 years ago
- Resources used in the production of my "Managing Infrastructure With Terraform" course☆23Aug 12, 2020Updated 5 years ago
- ☆12Mar 15, 2022Updated 4 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆19Aug 16, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Jan 17, 2022Updated 4 years ago
- RATP SDK - Retrieve schedules for any given RER (train), Metro, or Tramway stop in real time☆23Oct 23, 2019Updated 6 years ago
- Sample AWS Batch project to read CSV files☆11Oct 22, 2017Updated 8 years ago
- ☆13Jun 14, 2016Updated 10 years ago
- fork from reverse snowflake joins (https://sourceforge.net/projects/revj/)☆17Jun 13, 2021Updated 5 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 11 years ago
- Mirror of Apache Spark☆11Apr 30, 2026Updated last month
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Sep 1, 2016Updated 9 years ago
- Orchestration, Management and Monitoring of Data Processing☆11Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CLI for creating databases for Data Quality Dashboards.☆19Oct 26, 2019Updated 6 years ago
- Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logi…☆13Mar 8, 2020Updated 6 years ago
- A module that processes new Edgar filings and sends out notifications☆14Dec 28, 2015Updated 10 years ago
- Automatic Text Summarization with Machine Learning☆15Jul 30, 2017Updated 8 years ago
- Anything we need to maintain the Linked Open Data (LOD) publication of CEUR-WS.org☆16Jun 10, 2020Updated 6 years ago
- Code repository for Learning Apache Spark 2, published by Packt☆21Jan 30, 2023Updated 3 years ago
- Detect memory leaks in minutes without a heap dump.☆17Apr 7, 2017Updated 9 years ago