jdwittenauer / hadoop-training
Hadoop training material from free MapR courses.
☆53Updated 8 years ago
Alternatives and similar repositories for hadoop-training:
Users that are interested in hadoop-training are comparing it to the libraries listed below
- An unofficial Microsoft Machine Learning Server Docker image.☆143Updated 7 years ago
- Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causa…☆96Updated 6 years ago
- Get started with Gitlab in practicable time☆134Updated 5 years ago
- Hacker News plus topic tags. TechCrunch Disrupt NY Hackathon 2017☆123Updated 6 years ago
- A collection of descriptions of the architecture that various systems use.☆24Updated 6 years ago
- Using word vectors to classify spam messages☆150Updated 7 years ago
- An analysis of historical Hacker News data to determine the ranking algorithm☆85Updated 8 years ago
- exploring how to design, build and maintaining sane data pipelines☆22Updated 7 years ago
- BloomFilter in python☆101Updated 7 years ago
- A workshop for scientific computing in Python. ( December 2017 )☆378Updated 7 years ago
- Chaotic Life☆44Updated 7 years ago
- Fundamental algorithms☆93Updated 5 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- GitHub bot for improving your project's PR review workflow☆116Updated last month
- Pragmatic & Practical Bayesian Sentiment Classifier☆219Updated 8 years ago
- Spark Application : Spark Summit 2018 : Streaming Trend Discovery☆11Updated 6 years ago
- Curated list of all dataset websites that I find☆84Updated 6 years ago
- Various math-related things in Python code☆159Updated 4 years ago
- Code from my guides explaining various algorithms and data structures☆73Updated 6 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Interactive K-Nearest Neighbors machine learning algorithm in JavaScript.☆82Updated 4 years ago
- A module which fairly distributes a list of arbitrary objects among a set of targets, considering weights.☆77Updated 7 years ago
- Calculates marginal tax for consultants, in both directions (net <-> gross).☆84Updated 2 years ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- Blockchain fabric code☆41Updated 9 years ago
- Googol Game or "You should learn when to quit". A JavaScript game.☆49Updated 5 years ago
- Implementations of mathematical functions, formulas and concepts☆93Updated 5 years ago
- Index & Search Hacker News using Elasticsearch and the HN API☆96Updated 7 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆261Updated 7 years ago
- A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub☆67Updated 6 years ago