Parquet file generator
☆22Apr 17, 2018Updated 8 years ago
Alternatives and similar repositories for parquet-generator
Users that are interested in parquet-generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- ☆10Jun 28, 2011Updated 14 years ago
- 基于多线程与epoll的高并发TCP服务器☆11Aug 4, 2018Updated 7 years ago
- A Python library and command line utility for manipulating and plotting stellar lightcurves.☆10Jun 14, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- SBT template project for creating Scala (micro-)benchmarks based on Caliper☆20May 2, 2012Updated 14 years ago
- benchmark-for-spark☆18May 7, 2025Updated last year
- Text Preprocessing in Python☆19Jan 15, 2017Updated 9 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- ☆15Jan 21, 2023Updated 3 years ago
- Bridging Immutable and Mutable Abstractions for Distributed Data Analytics☆12May 15, 2019Updated 7 years ago
- ☆12Jul 18, 2025Updated 11 months ago
- Demo code for implementing and showcasing a Fraud Detection Engine with Apache Flink.☆33Oct 20, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AWS Cost Monitoring terraform module☆17Nov 30, 2022Updated 3 years ago
- Open-Channel SSD emulator using memory☆22Nov 1, 2017Updated 8 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 3 years ago
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆131Dec 19, 2024Updated last year
- A set of tools for understanding F2FS usage of ZNS devices, which allow for identifying the on-device locations of files and inodes, mapp…☆20Jan 19, 2025Updated last year
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Python Repository of the Institute of Astronomy @ KU Leuven☆20Nov 5, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A GUI application for testing GRPC services☆18Nov 20, 2023Updated 2 years ago
- Utility to recursively scrape ArcGIS MapServer data using the ArcGIS REST API.☆11Jun 10, 2016Updated 10 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 8 months ago
- Jpak compression format☆15Mar 12, 2017Updated 9 years ago
- Fast, reliable, and scalable channels implementation based on Redis streams.☆11Jun 25, 2024Updated last year
- A public repo that contains integrations for Argilla and LlamaIndex.☆17Oct 10, 2024Updated last year
- A simple golang job queue☆13Jan 19, 2023Updated 3 years ago
- A multi-thread Redis implementation with RCU☆18May 22, 2025Updated last year
- Apache Hadoop HDFS Data Node Scheduler☆13Jun 4, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Material for the CorrelCon 2020 Advanced Session "Building a modularized Shiny app with the golem 📦 and html widgets"☆12Mar 10, 2021Updated 5 years ago
- A simple query library for scala and the Google data store.☆29Aug 10, 2011Updated 14 years ago
- Linux kernel SGX driver for Graphene☆12Nov 3, 2020Updated 5 years ago
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- Spark Terasort☆121Apr 21, 2023Updated 3 years ago
- opencloud with podman quadlets☆20Oct 21, 2025Updated 7 months ago
- ☆64Nov 8, 2019Updated 6 years ago