ifwe / antelope
Antelope Realtime Events framework for feature engineering in agile machine learning environments.
☆26Updated 9 years ago
Alternatives and similar repositories for antelope:
Users that are interested in antelope are comparing it to the libraries listed below
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- scalding powered machine learning☆109Updated 10 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 9 years ago
- Distributed Matrix Library☆71Updated 8 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- Templates for projects based on top of H2O.☆38Updated last month
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Code and Presentation slides for Teaching the Elephant to Read☆17Updated 9 years ago
- Applied Machine Learning in Python with scikit-learn☆47Updated 14 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Python (PyMC) adaptation of the R code from "Doing Bayesian Data Analysis"☆64Updated 8 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago
- training material☆47Updated 6 months ago
- ☆12Updated 9 years ago
- ☆111Updated 8 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 8 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆243Updated 9 years ago
- A scala-based feature generation and modeling framework☆61Updated 6 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Machine Learning with Scikit-Learn (material for pydata Amsterdam 2016)☆30Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 8 years ago
- Dynamic programming inference by continuation hashing.☆29Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago