max-webster/get-started-impala

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/max-webster/get-started-impala)

max-webster / get-started-impala

This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)

☆22

Alternatives and similar repositories for get-started-impala

Users that are interested in get-started-impala are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yeleid / eagleeye
View on GitHub
An app built on Cloudera Enterprise for tracking metrics of jobs that run in YARN framework
☆13Feb 5, 2016Updated 10 years ago
lintool / SparkTutorial
View on GitHub
Spark Tutorial at the University of Maryland
☆37Oct 24, 2014Updated 11 years ago
quartethealth / spark-fixedwidth
View on GitHub
Fixed-width data source for Spark SQL and DataFrames
☆10Oct 25, 2016Updated 9 years ago
intel / open-network-insight
View on GitHub
This site has moved to the ONI organization at https://github.com/Open-Network-Insight
☆14Apr 5, 2016Updated 10 years ago
tmalaska / HBase-ToHDFS
View on GitHub
Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet
☆28May 15, 2014Updated 12 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cloudera / impala-tpcds-kit
View on GitHub
TPC-DS Kit for Impala
☆170May 20, 2024Updated 2 years ago
kj-ki / tpc-h-impala
View on GitHub
TPC-H Benchmark on Cloudera Impala
☆19Apr 25, 2013Updated 13 years ago
Azure-Samples / event-hubs-dotnet-ingest
View on GitHub
Shows how to send data to an event hub, in C#.
☆13Jan 9, 2024Updated 2 years ago
bfeif / polars-for-data-science-oreilly-course
View on GitHub
☆13Jul 1, 2025Updated last year
influxdata / parquet-bloom-filter-analysis
View on GitHub
Generate Parquet Files
☆14Apr 23, 2026Updated 3 months ago
albertoRamon / Kylin
View on GitHub
See Apache Kylin Website for a complete description
☆30May 28, 2018Updated 8 years ago
ArroyoSystems / streamgen
View on GitHub
Mock streaming data generator
☆18May 31, 2024Updated 2 years ago
rustyrazorblade / cdm
View on GitHub
Cassandra Dataset Manager
☆14Sep 1, 2017Updated 8 years ago
teamclairvoyant / hadoop-deployment-bash
View on GitHub
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
☆33Nov 28, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cloudera / hs2client
View on GitHub
C++ native client for Impala and Hive, with Python / pandas bindings
☆72Aug 15, 2018Updated 7 years ago
GoogleCloudPlatform / datacatalog-tag-history
View on GitHub
Historical metadata of your data warehouse is a treasure trove to discover not just insights about changing data patterns, but also quali…
☆13Jul 21, 2021Updated 5 years ago
wsuen / PyGotham_Spark_Streaming_demo
View on GitHub
PyGotham 2017: Spark Streaming for World Domination (and other projects)
☆10Oct 5, 2017Updated 8 years ago
sschatts / conference_talks
View on GitHub
☆13Oct 5, 2019Updated 6 years ago
jgperrin / net.jgp.books.spark.ch16
View on GitHub
Spark in Action, 2nd edition - chapter 16 - performance, checkpointing, and caching
☆12Apr 21, 2023Updated 3 years ago
tfayyaz / cloud-dataproc
View on GitHub
Cloud Dataproc: Samples and Utils
☆11Sep 23, 2020Updated 5 years ago
jamescasbon / vertica-sqlalchemy
View on GitHub
vertica dialect for sqlalchemy
☆12Aug 25, 2015Updated 10 years ago
dhutchis / LaraDB
View on GitHub
A platform for unified linear and relational algebra analytics, built on the Accumulo NoSQL database
☆13Feb 9, 2022Updated 4 years ago
tmalaska / SparkUnitTestingExamples
View on GitHub
This project is a collection of Spark Unit Tests Examples to help new Spark users have good examples on how to unit start their code for …
☆35Sep 30, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GoogleCloudPlatform / bigquery-notebooks
View on GitHub
☆19Aug 23, 2022Updated 3 years ago
vangj / vagrant-hadoop-2.3.0
View on GitHub
A project that creates a VM with single node setup of Hadoop v2.3.0 with YARN installed.
☆19Apr 16, 2014Updated 12 years ago
jgperrin / net.jgp.books.spark.ch15
View on GitHub
Spark in Action, 2nd edition - chapter 15 - Aggregating your data
☆12Sep 8, 2022Updated 3 years ago
omribahumi / python_tornado_thrift
View on GitHub
Using Python Tornado to serve Thrift HTTP requests
☆13Dec 22, 2012Updated 13 years ago
jgperrin / net.jgp.books.spark.ch17
View on GitHub
Spark in Action, 2nd edition - chapter 16 - exporting data, using delta lake
☆14Apr 21, 2023Updated 3 years ago
ConsenSys-archive / polygon-box
View on GitHub
Boilerplate code for deploying contracts to the Polygon Matic PoS network.
☆39Dec 2, 2021Updated 4 years ago
BigtoC / FUTU_Stop_Loss
View on GitHub
A stock stop loss python program using FUTU API
☆12Feb 18, 2019Updated 7 years ago
0312birdzhang / Zabbix4j
View on GitHub
Use https://github.com/tinawenqiao/zabbix3api please
☆12Nov 10, 2017Updated 8 years ago
apiacademy / ansible-consul-demo
View on GitHub
Demo of Consul & Ansible @ AnsibleFest NY, 2015. Accompanying slide deck:
☆14Oct 27, 2015Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jgperrin / net.jgp.books.spark.ch13
View on GitHub
Spark in Action, 2nd edition - chapter 13 - Transforming documents
☆14Apr 21, 2023Updated 3 years ago
google / dataflow-ml-starter
View on GitHub
☆24Feb 16, 2026Updated 5 months ago
CODAIT / spark-db2
View on GitHub
DB2/DashDB Connector for Apache Spark
☆14Jul 30, 2021Updated 4 years ago
jgperrin / net.jgp.books.spark.ch11
View on GitHub
Spark in Action, 2nd edition - chapter 11 - Working with SQL
☆15Apr 21, 2023Updated 3 years ago
evansendra / sublime-text-icon
View on GitHub
A replacement icon for Sublime Text 2 and Sublime Text 3
☆15Oct 2, 2015Updated 10 years ago
helena / spark-cassandra
View on GitHub
An Akka Extension for easy integration of spark and cassandra in Akka micro services.
☆24Sep 25, 2014Updated 11 years ago
jgperrin / net.jgp.books.spark.ch08
View on GitHub
Spark in Action, 2nd edition - chapter 8
☆18Apr 21, 2023Updated 3 years ago