jdwittenauer / hadoop-training
Hadoop training material from free MapR courses.
☆54Updated 7 years ago
Alternatives and similar repositories for hadoop-training:
Users that are interested in hadoop-training are comparing it to the libraries listed below
- An unofficial Microsoft Machine Learning Server Docker image.☆143Updated 7 years ago
- A collection of descriptions of the architecture that various systems use.☆24Updated 6 years ago
- Hacker News plus topic tags. TechCrunch Disrupt NY Hackathon 2017☆123Updated 6 years ago
- Is there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causa…☆97Updated 6 years ago
- Chaotic Life☆45Updated 7 years ago
- Various math-related things in Python code☆159Updated 3 years ago
- Get started with Gitlab in practicable time☆134Updated 5 years ago
- Topic Modeling over Paul Graham's essays☆12Updated 6 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- Using word vectors to classify spam messages☆150Updated 7 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆30Updated 6 years ago
- An analysis of historical Hacker News data to determine the ranking algorithm☆85Updated 7 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- RESEARCH [NLP ] This is an implementation of "Automatic Consensus-Based Text Summarizer" along with text-organizing capabilities that ca…☆98Updated 7 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆262Updated 7 years ago
- BloomFilter in python☆102Updated 7 years ago
- exploring how to design, build and maintaining sane data pipelines☆22Updated 7 years ago
- A module which fairly distributes a list of arbitrary objects among a set of targets, considering weights.☆77Updated 7 years ago
- Sharing interesting and noteworthy Data Engineering content☆66Updated 8 years ago
- Java INtegrated Query in parlance with LINQ is an ultra minimalistic library for Java inspired from and mimicking the .NET LINQ. While LI…☆85Updated 3 years ago
- A workshop for scientific computing in Python. ( December 2017 )☆378Updated 6 years ago
- A primer for data science tools in Python☆56Updated 5 years ago
- An event bus framework for event driven programming☆71Updated 2 years ago
- A tool to manage GitHub repo collaborators with files☆79Updated 6 years ago
- Interactive K-Nearest Neighbors machine learning algorithm in JavaScript.☆82Updated 4 years ago
- Summaries of papers that I've read☆9Updated 9 years ago
- Quick implementations of some advanced algorithms for searching, sorting and trees☆77Updated 5 years ago
- Course materials for Research Software Engineering course.☆45Updated 5 years ago
- Googol Game or "You should learn when to quit". A JavaScript game.☆48Updated 5 years ago
- Data exploration of software developer salary info from Hacker News☆41Updated 8 years ago