dremio-hub / dremio-flight-connector
Dremio Flight connector. Access Dremio using Arrow flight
☆40Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for dremio-flight-connector
- Demonstration of a Hive Input Format for Iceberg☆26Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆56Updated last year
- ☆13Updated last week
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- A library for Spark DataFrame using MinIO Select API☆96Updated 5 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 9 months ago
- A testing framework for Trino☆26Updated this week
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 7 months ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- ☆104Updated last year
- ☆39Updated 5 years ago
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Unity Catalog UI☆39Updated 2 months ago
- Point-in-Time optimizations for Apache Spark☆29Updated 10 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 6 months ago
- Alluxio Python client - Access Any Data Source with Python☆25Updated 3 weeks ago
- A composable framework for fast and scalable data analytics☆57Updated last year
- Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization s…☆54Updated last year
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆16Updated 3 years ago
- Fybrik platform - Arrow/Flight module☆16Updated 3 months ago
- The Internals of Apache Beam☆12Updated 4 years ago
- ☆24Updated 2 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆92Updated 3 weeks ago
- Cask Hydrator Plugins Repository☆67Updated 3 weeks ago
- Scalable CDC Pattern Implemented using PySpark☆18Updated 5 years ago