☆40Feb 1, 2017Updated 9 years ago
Alternatives and similar repositories for cuckoofilter
Users that are interested in cuckoofilter are comparing it to the libraries listed below
Sorting:
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- Natural language hashing library.☆10Nov 24, 2014Updated 11 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Jun 1, 2015Updated 10 years ago
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Simple spill-to-disk dictionary☆18May 24, 2016Updated 9 years ago
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- Twitter Bot to perform advanced search and automated response☆13Dec 22, 2017Updated 8 years ago
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- Dynamic Numpy arrays☆13Feb 26, 2017Updated 9 years ago
- GRPC-like RPC library that supports file descriptor passing by using Argdata☆19Jan 14, 2019Updated 7 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆15Jan 31, 2017Updated 9 years ago
- A skip dict is a Python dictionary which is permanently sorted by value.☆19Sep 25, 2014Updated 11 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Jan 30, 2013Updated 13 years ago
- A backport of the `yield from` semantic from Python 3.x to Python 2.7☆20Dec 17, 2019Updated 6 years ago
- Dask and Spark interactions☆21Mar 13, 2017Updated 8 years ago
- ☆35Jan 17, 2015Updated 11 years ago
- SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.☆46Jul 8, 2018Updated 7 years ago
- ☆20Jun 26, 2017Updated 8 years ago
- Python implementation of nonparametric nearest-neighbor-based estimators for divergences between distributions.☆48Mar 13, 2017Updated 8 years ago
- Convert LaTeX beamer files to Jupyter/IPython notebooks and RISE☆25Aug 28, 2025Updated 6 months ago
- A colony of interacting processes☆23Jun 17, 2024Updated last year
- Reduce your data. A unix filter for algebird-powered aggregation.☆141Apr 17, 2017Updated 8 years ago
- A Neural network implementation with Scala☆20Jul 17, 2016Updated 9 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Jan 30, 2014Updated 12 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Oct 13, 2022Updated 3 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Implements a Min-Heap / Priority Queue in C using an indirection table for memory efficiency.☆29Aug 26, 2014Updated 11 years ago
- implementations of a counting bloom, a timing bloom and a scaling timing bloom... made for working with streams!☆42Feb 1, 2017Updated 9 years ago
- It is a forest of random projection trees☆225Feb 8, 2020Updated 6 years ago
- Bayesian supervised learning in Python☆21Oct 19, 2015Updated 10 years ago
- FRED simulator and associated paper☆26Jan 15, 2016Updated 10 years ago
- Module for supporting writing in a single source file a python module and a corresponding cython module. Contrary to cython pure python m…☆26Jun 7, 2016Updated 9 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Nov 25, 2017Updated 8 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Beating the benchmark for Microsoft Malware Classification Challenge (BIG 2015)☆28Feb 17, 2015Updated 11 years ago
- ☆10Apr 20, 2022Updated 3 years ago
- A short paper describing the library is available on arXiv.☆64Jan 5, 2018Updated 8 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Aug 8, 2016Updated 9 years ago