Data set and queries that I use in my Hive and Impala presentations. Slides are usually posted at slideshare.net/markgrover
☆20May 19, 2014Updated 12 years ago
Alternatives and similar repositories for cloudcon-hive
Users that are interested in cloudcon-hive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In-kernel RDMA library☆13Nov 7, 2023Updated 2 years ago
- Stacking a block device over another block device☆17Oct 29, 2014Updated 11 years ago
- Unix环境高级编程学习笔记☆13Jul 20, 2014Updated 11 years ago
- CoRM: Compactable Remote Memory over RDMA☆20Jun 18, 2021Updated 4 years ago
- Labs and data files for a full-day Spark workshop☆25May 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- On Stacking a Persistent Memory File System on Legacy File Systems [FAST '23]☆18May 18, 2023Updated 3 years ago
- A Notebook on building a Suicide Ideation Classifier using Natural Language Processing(NLP)☆11Aug 17, 2020Updated 5 years ago
- 腾讯优图开放平台 c++ sdk☆15Apr 10, 2019Updated 7 years ago
- 项目中保留了向开源社区提交 过的patch☆16Oct 22, 2017Updated 8 years ago
- DHT-based Distributed File System for MapReduce Jobs☆26Mar 6, 2026Updated 3 months ago
- Swarm64 DA Benchmark Toolkit☆32Dec 22, 2025Updated 5 months ago
- Welcome to the RAG University repository! This repository contains code implementations for Retrieval-Augmented Generation (RAG) models, …☆23Dec 18, 2023Updated 2 years ago
- This repo contains commands that data engineers use in day to day work.☆61Feb 4, 2023Updated 3 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Jul 6, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- React rendering for Meteor apps☆12Mar 4, 2015Updated 11 years ago
- ☆54Feb 1, 2026Updated 4 months ago
- A high-performance and easy-to-use HTTP service framework.☆32Nov 24, 2019Updated 6 years ago
- ListDB: Union of Write-Ahead Logs and Persistent SkipLists for Incremental Checkpointing on Persistent Memory☆50Jul 18, 2024Updated last year
- All Data Engineering notebooks from Datacamp course☆116Dec 11, 2019Updated 6 years ago
- Serverless Architecture with AWS☆10Apr 15, 2016Updated 10 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆364Oct 29, 2022Updated 3 years ago
- Explore external scalers built by the community.☆12Mar 23, 2026Updated 2 months ago
- Merkle tree and other data structures.☆17Dec 30, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code review checklist with examples (still WIP).☆15Jul 30, 2022Updated 3 years ago
- implement similar functionalities using different java concurrency utilities and compare the performance☆13Dec 7, 2016Updated 9 years ago
- Problems from algo expert solved in Java☆12Jan 16, 2020Updated 6 years ago
- Automation of desktop, web, mainframe and citrix based processes using RPA tools such as BluePrism, PegaRobotics, Automaton Anywhere and …☆11Dec 9, 2017Updated 8 years ago
- Dev Ops Dashboard for Petabyte Scale AI Data Lake☆12Mar 28, 2023Updated 3 years ago
- 🔑 A service which provides continuous user authentication to web applications, using keystroke dynamics.☆13Oct 24, 2018Updated 7 years ago
- Apache RocketMQ lite cpp client☆11May 15, 2026Updated 3 weeks ago
- A personal homepage to start coding☆14Aug 1, 2019Updated 6 years ago
- Machine Learning Workshop Resources☆12Feb 16, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- As customers move from building data lakes and analytics on AWS to building machine learning solutions, one of their biggest challenges i…☆63Nov 28, 2018Updated 7 years ago
- ☆11Mar 15, 2017Updated 9 years ago
- React Starter Kit — a skeleton of a simple web application built with React.js, JSX, ES6+, Babel, PostCSS, ReactHotLoader, and Webpack.☆14Jun 22, 2016Updated 9 years ago
- SoftUni course CSharp OOP Advanced: All tasks with their solutions.☆10Aug 14, 2020Updated 5 years ago
- Python design patterns (https://app.pluralsight.com/library/courses/python-design-patterns)☆13Mar 11, 2018Updated 8 years ago
- [Archived] A Fast Multi-tiered Distributed Storage System based on User-Level I/O☆75Mar 2, 2018Updated 8 years ago
- Interactive HTML canvas based implementation of k-means.☆16Mar 24, 2018Updated 8 years ago