bauman / python-bson-streamingLinks
BSON stream raw data into dict or individual BSON format - python
☆37Updated 3 months ago
Alternatives and similar repositories for python-bson-streaming
Users that are interested in python-bson-streaming are comparing it to the libraries listed below
Sorting:
- PySpark Cassandra brings back the fun in working with Cassandra data in PySpark.☆79Updated 8 years ago
- Battle-tested Apache Storm Multi-Lang implementation for Python☆70Updated 6 months ago
- Hyper LogLog (native and sliding) cardinality counters☆241Updated 5 months ago
- Python Driver for Apache Drill.☆61Updated 2 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 5 years ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 10 years ago
- python implementation of the parquet columnar file format.☆358Updated 4 years ago
- An Apache Spark-shell backend for IPython☆105Updated 4 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 10 years ago
- A pure python HDFS client☆860Updated 3 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- A Python library for dealing with splittable files☆42Updated 6 years ago
- A Python MapReduce and HDFS API for Hadoop☆241Updated 2 weeks ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- python library for interacting with SolrCloud☆36Updated 4 years ago
- Utils around luigi.☆66Updated 5 months ago
- Docker containers for the IPython notebook (+SciPy Stack)☆187Updated 9 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 7 years ago
- ☆16Updated 10 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆124Updated 10 years ago
- Fast HyperLogLog for Python.☆110Updated 5 months ago
- Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python☆246Updated 3 years ago
- Send summary messages of your Luigi jobs to Slack☆46Updated 6 years ago
- ☆146Updated 9 years ago
- SolrClient is a simple python library for Solr; built in python3 with support for latest features of Solr.☆64Updated 5 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 10 years ago
- Utilities to work with Scala/Java code with py4j☆40Updated 2 years ago