jdwittenauer / hadoop-trainingLinks
Hadoop training material from free MapR courses.
☆54Updated 8 years ago
Alternatives and similar repositories for hadoop-training
Users that are interested in hadoop-training are comparing it to the libraries listed below
Sorting:
- An analysis of historical Hacker News data to determine the ranking algorithm☆85Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- An unofficial Microsoft Machine Learning Server Docker image.☆143Updated 7 years ago
- A workshop for scientific computing in Python. ( December 2017 )☆378Updated 7 years ago
- Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causa…☆96Updated 6 years ago
- Hacker News plus topic tags. TechCrunch Disrupt NY Hackathon 2017☆123Updated 6 years ago
- Various math-related things in Python code☆160Updated 4 years ago
- Chaotic Life☆44Updated 7 years ago
- A collection of descriptions of the architecture that various systems use.☆24Updated 6 years ago
- Using word vectors to classify spam messages☆150Updated 7 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- Spark Application : Spark Summit 2018 : Streaming Trend Discovery☆11Updated 6 years ago
- Classic Hacker News stories☆144Updated 5 years ago
- BloomFilter in python☆101Updated 7 years ago
- Index & Search Hacker News using Elasticsearch and the HN API☆96Updated 7 years ago
- Get started with Gitlab in practicable time☆134Updated 5 years ago
- Links to excellent videos, articles, blogs, etc. on microservices architecture☆82Updated 6 years ago
- A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub☆67Updated 7 years ago
- Fundamental algorithms☆93Updated 6 years ago
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆261Updated 7 years ago
- ☆92Updated 8 years ago
- Develop and run your Python applications in clean Docker environments☆351Updated 5 years ago
- Introduction to common Probabilistic Algorithms: Approximate Counting, Flajolet-Martin, LogLog, HyperLogLog, Bloom Filters☆60Updated 8 years ago
- At Twitter I often asked a simple question, render a tweet given the text and an unordered list of its entities☆42Updated 3 years ago
- A primer for data science tools in Python☆56Updated 5 years ago
- Curated list of all dataset websites that I find☆84Updated 6 years ago
- ☆13Updated 9 years ago
- Googol Game or "You should learn when to quit". A JavaScript game.☆49Updated 5 years ago
- Directory of Jupyter notebooks exploring various topics☆316Updated 8 years ago