Parquet file generator
☆22Apr 17, 2018Updated 7 years ago
Alternatives and similar repositories for parquet-generator
Users that are interested in parquet-generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Albis: High-Performance File Format for Big Data Systems☆21Jul 12, 2018Updated 7 years ago
- Code examples for my blog posts☆22Nov 7, 2018Updated 7 years ago
- 基于多线程与epoll的高并发TCP服务器☆11Aug 4, 2018Updated 7 years ago
- A Python library and command line utility for manipulating and plotting stellar lightcurves.☆10Jun 14, 2016Updated 9 years ago
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Memory Disaggregation on POWER9 with OpenCAPI 3.0 M1 & C1☆38Dec 2, 2021Updated 4 years ago
- benchmark-for-spark☆18May 7, 2025Updated 10 months ago
- Text Preprocessing in Python☆19Jan 15, 2017Updated 9 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- A research group at UCSD CSE focused on Advanced Data Analytics: data management and systems for ML/AI and data science.☆11Feb 27, 2026Updated last month
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Open-Channel SSD emulator using memory☆22Nov 1, 2017Updated 8 years ago
- A GameBoy Emulator written in Rust, written as a learning project for both☆10Jun 6, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange☆131Dec 19, 2024Updated last year
- A set of tools for understanding F2FS usage of ZNS devices, which allow for identifying the on-device locations of files and inodes, mapp…☆20Jan 19, 2025Updated last year
- ☆16Apr 10, 2024Updated last year
- Large scale query engine benchmark☆99Apr 5, 2016Updated 9 years ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 10 years ago
- A versioned database inspired by Git☆16Dec 16, 2017Updated 8 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- A research and review of techniques to provide a natural language interface to RDMS.☆10Dec 8, 2017Updated 8 years ago
- Python Repository of the Institute of Astronomy @ KU Leuven☆20Nov 5, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A GUI application for testing GRPC services☆18Nov 20, 2023Updated 2 years ago
- Implementation of time series compression method based on the Facebook Gorilla paper☆13Jan 26, 2026Updated 2 months ago
- A Spark datasource for the HadoopCryptoLedger library☆13Sep 29, 2025Updated 6 months ago
- ZNS Append-only based LSM key-value store☆21Sep 22, 2023Updated 2 years ago
- ☆13Oct 30, 2019Updated 6 years ago
- Fast, reliable, and scalable channels implementation based on Redis streams.☆11Jun 25, 2024Updated last year
- Kiwix Catalog BitTorrent Seeder Companion