YelpArchive / data_pipelineView external linksLinks
Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.
☆110Aug 17, 2022Updated 3 years ago
Alternatives and similar repositories for data_pipeline
Users that are interested in data_pipeline are comparing it to the libraries listed below
Sorting:
- Provides a Pythonic interface for reading and writing Avro schemas☆27Aug 17, 2022Updated 3 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆88Mar 2, 2021Updated 4 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Aug 24, 2023Updated 2 years ago
- MySQLStreamer is a database change data capture and publish system.☆411Aug 17, 2022Updated 3 years ago
- Marquez Web UI☆21Nov 13, 2020Updated 5 years ago
- Interfaces and shared infrastructure for generic task processing at Yelp.☆23Sep 4, 2025Updated 5 months ago
- Skeleton project for Apache Airflow training participants to work on.☆17Jul 9, 2020Updated 5 years ago
- Astronomer Vendor Images☆17Jan 28, 2026Updated 2 weeks ago
- Automation of desktop, web, mainframe and citrix based processes using RPA tools such as BluePrism, PegaRobotics, Automaton Anywhere and …☆11Dec 9, 2017Updated 8 years ago
- ☆42Aug 17, 2022Updated 3 years ago
- A homebrewed cyber threat intelligence solution☆20Nov 20, 2012Updated 13 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Oct 1, 2022Updated 3 years ago
- Jupyter Integration for MATLAB using VNC☆16Sep 26, 2025Updated 4 months ago
- A dbt adapter for TiDB☆14Dec 14, 2023Updated 2 years ago
- 🚦 SpeedTracker API layer☆13Jan 19, 2019Updated 7 years ago
- Solidity (Ethereum smart contract) language support for Blockly☆21Aug 21, 2019Updated 6 years ago
- Repeatable, evidence-based expense and income accounting☆41Sep 16, 2025Updated 4 months ago
- Streaming Data Simulator☆17Oct 12, 2020Updated 5 years ago
- Monitor docker Swarm services and sends a pushover notification if anyone is down☆22Nov 27, 2019Updated 6 years ago
- React version of dbt labs dbt docs app☆17Nov 14, 2021Updated 4 years ago
- some tools for dealing with IRC in scala☆24May 26, 2015Updated 10 years ago
- ☆17Sep 27, 2022Updated 3 years ago
- Generate a Go file with the output of compile time shell commands☆19May 17, 2021Updated 4 years ago
- Kubernetes cifs volume plugin☆24Mar 8, 2017Updated 8 years ago
- Display version and compression information about a parquet file☆25Feb 3, 2026Updated last week
- Next generation batch process scheduling and management☆352Updated this week
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆22Feb 5, 2016Updated 10 years ago
- A set of py.test fixtures for AWS Chalice☆21Jul 1, 2020Updated 5 years ago
- This repo contains the content for the CFCR documentation.☆19Feb 25, 2021Updated 4 years ago
- A high performance replicated log service. (The development is moved to Apache Incubator)☆2,208Feb 25, 2020Updated 5 years ago
- ☆10Sep 7, 2021Updated 4 years ago
- A Github API client to extract events and actions, and load into a database☆28Oct 22, 2021Updated 4 years ago
- Dynamic ORM operations using pydantic☆29Feb 28, 2023Updated 2 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Jun 8, 2016Updated 9 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Jun 4, 2018Updated 7 years ago
- Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool☆31Nov 15, 2022Updated 3 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Jan 15, 2026Updated 3 weeks ago
- AWS Terraform Module to manage SSM Patch Management Resources☆10May 17, 2022Updated 3 years ago
- internal API for call processing☆11Jan 30, 2026Updated 2 weeks ago