airbnb / airbnb-spark-thriftLinks
A library for loadling Thrift data into Spark SQL
☆43Updated 2 years ago
Alternatives and similar repositories for airbnb-spark-thrift
Users that are interested in airbnb-spark-thrift are comparing it to the libraries listed below
Sorting:
- A dynamic graph-based metric computation engine.☆49Updated 2 years ago
- Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application☆102Updated 2 years ago
- IntelliJ IDEA code style settings for Airbnb's Java and Android projects.☆40Updated 10 years ago
- Send Kafka Metrics to StatsD.☆135Updated 4 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70Updated 2 years ago
- KafkaT-ool☆503Updated 6 years ago
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆194Updated this week
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- Live-updating Spark UI built with Meteor☆189Updated 4 years ago
- ☆70Updated 8 years ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆296Updated 8 months ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Updated 10 years ago
- Reversible conversions between types☆658Updated 10 months ago
- Mirror of Apache Crunch (Incubating)☆108Updated 4 years ago
- A Scala feature transformation library for data science and machine learning☆469Updated 8 months ago
- A tool for data sampling, data generation, and data diffing☆344Updated 5 months ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- Library for organizing batch processing pipelines in Apache Spark☆42Updated 8 years ago
- hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format☆127Updated 3 years ago
- File compaction tool that runs on top of the Spark framework.☆59Updated 6 years ago
- PostgreSQL protocol gateway for Presto distributed SQL query engine☆292Updated 2 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- Change Data Capture (CDC) service☆447Updated last year
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆254Updated 3 months ago
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Updated 8 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- Hive + Avro. Serde for working with Avro in Hive☆59Updated last year
- A connector for SingleStore and Spark☆162Updated 3 weeks ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Updated 5 years ago