pluralsight / spavroLinks
Spavro is a (sp)eedier avro library -- Spavro is a fork of the official Apache AVRO python 2 implementation with the goal of greatly improving data read deserialization and write serialization performance.
☆26Updated 2 years ago
Alternatives and similar repositories for spavro
Users that are interested in spavro are comparing it to the libraries listed below
Sorting:
- Concurrent appendable key-value storage☆107Updated last year
- Fork of aio-libs/aiokafka☆27Updated last year
- Python library for handling efficiently sorted integer sets.☆218Updated 3 months ago
- A consistent table management library in python☆160Updated 2 years ago
- fast data loading with binary copy☆118Updated 6 months ago
- Pandas Msgpack☆24Updated 3 years ago
- Python DataFrame with fast insert and appends☆75Updated last month
- Distributed process pool for Python☆110Updated 3 years ago
- A module for getting data into python from large data sources☆176Updated last year
- Pandas ExtensionDType/Array backed by Apache Arrow☆232Updated 2 years ago
- Function dependencies resolution and execution☆71Updated 5 years ago
- Caching based on computation time and storage space☆137Updated 4 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Serializes data into a JSON format using AVRO schema.☆138Updated 3 years ago
- Easily LISTEN to PostgreSQL NOTIFY messages☆68Updated 4 years ago
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- Useful Mutable Mappings☆70Updated last year
- python implementation of the parquet columnar file format.☆21Updated last week
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Deploy dask on YARN clusters☆69Updated last year
- Python bindings for FarmHash and CityHash☆45Updated 2 months ago
- A pipeline abstraction for Python☆168Updated 4 years ago
- IP Address dtype and block for pandas☆105Updated 2 years ago
- Undermining Python's "turtles-all-the-way-up" asynchronous idiom.☆19Updated 6 years ago
- Embedded MonetDB with a Python frontend and fast Numpy/Pandas support☆63Updated last year
- persistent caching to memory, disk, or database☆275Updated 3 months ago
- ☆21Updated 3 weeks ago
- Derivatives models written with the Tributary data flow library☆24Updated this week