pluralsight / spavroLinks
Spavro is a (sp)eedier avro library -- Spavro is a fork of the official Apache AVRO python 2 implementation with the goal of greatly improving data read deserialization and write serialization performance.
☆26Updated last year
Alternatives and similar repositories for spavro
Users that are interested in spavro are comparing it to the libraries listed below
Sorting:
- Fork of aio-libs/aiokafka☆27Updated last year
- Python library for handling efficiently sorted integer sets.☆211Updated 2 weeks ago
- Concurrent appendable key-value storage☆107Updated 11 months ago
- Python library to infer date format from examples☆43Updated 3 years ago
- 💥 Cython bindings for MurmurHash2☆43Updated last month
- Python bindings for simdjson using libpy☆66Updated 2 years ago
- A factory for simplekv-Store-based storage classes.☆24Updated last year
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- python implementation of the parquet columnar file format.☆21Updated 3 months ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Useful Mutable Mappings☆70Updated last year
- Fast HyperLogLog for Python.☆107Updated 6 months ago
- A query and aggregation framework for Bcolz (W2013-01)☆56Updated last year
- Python DataFrame with fast insert and appends☆75Updated 2 months ago
- Pandas Msgpack☆23Updated 2 years ago
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆116Updated last month
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- fast data loading with binary copy☆118Updated 4 months ago
- ☆20Updated 10 months ago
- A pipeline abstraction for Python☆168Updated 4 years ago
- Caching based on computation time and storage space☆137Updated 4 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆86Updated last month
- Python library for class-based schema definition, object serialization and data validation☆61Updated 9 years ago
- A consistent table management library in python☆159Updated 2 years ago
- Function dependencies resolution and execution☆70Updated 5 years ago
- persistent caching to memory, disk, or database☆275Updated 3 weeks ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆230Updated 2 years ago
- Python module for computing statistics and regression in a single pass.☆99Updated 4 years ago