jeromebanks / brickhouseView external linksLinks
Hive UDF's for the data warehouse
☆19May 7, 2018Updated 7 years ago
Alternatives and similar repositories for brickhouse
Users that are interested in brickhouse are comparing it to the libraries listed below
Sorting:
- Scala for the Impatient (2nd edition) - My Solutions☆10Dec 22, 2017Updated 8 years ago
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- naive bayesian,knn java demo☆14Aug 29, 2013Updated 12 years ago
- Web-based application for storage geodata☆11Sep 21, 2021Updated 4 years ago
- Scala Kittens, some useful classes, some experimental code☆57Jan 22, 2024Updated 2 years ago
- Distributed Factorization Machines and LR with ps-lite☆10Sep 27, 2017Updated 8 years ago
- A Java implementation of LIBFFM: A Library for Field-aware Factorization Machines☆10Jan 4, 2022Updated 4 years ago
- WSGI (Python, Flask) service for ArcGIS Feature Layer REST replacement☆14Mar 19, 2014Updated 11 years ago
- ☆12Jan 4, 2020Updated 6 years ago
- Merge Small files for Hive Table on HDFS☆15Mar 4, 2014Updated 11 years ago
- spring5中文注释版☆13Oct 13, 2019Updated 6 years ago
- This is python implementation of Programming Exercises for☆13Mar 4, 2016Updated 9 years ago
- ☆12Oct 8, 2021Updated 4 years ago
- keywords extraction☆18Dec 15, 2015Updated 10 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆13Jul 27, 2024Updated last year
- ☆28Dec 5, 2025Updated 2 months ago
- Randomized SVD of large sparse matrices on Spark☆77Jul 21, 2022Updated 3 years ago
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included☆16Jan 26, 2026Updated 3 weeks ago
- Distributed implementation of DeepWalk on Apache Spark☆14Apr 12, 2018Updated 7 years ago
- Traversal based router for aiohttp.web☆21Oct 31, 2018Updated 7 years ago
- contains of examples, notes, and projects on Spark☆18Aug 1, 2021Updated 4 years ago
- Helpful tools for monitoring Kafka Connect☆20Feb 20, 2018Updated 7 years ago
- This repo contains the vagrant file and configuratios for setting up a three node (client, master and data node) elasticsearch(2.2.0 vers…☆18Feb 4, 2016Updated 10 years ago
- Export data from dwg files via AutoCAD ActiveX API☆19Sep 17, 2014Updated 11 years ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Nov 25, 2014Updated 11 years ago
- HBase操作封装的orm: easy-hbase,更方便的使用HBase☆20Jan 16, 2024Updated 2 years ago
- A table schema-less OLAP Analytics Engine for Big Data.☆24Apr 23, 2024Updated last year
- Note anything during writing spark or scala☆19Sep 29, 2017Updated 8 years ago
- ☆21Feb 5, 2025Updated last year
- Live Photos via Web Components☆22Aug 11, 2019Updated 6 years ago
- A report(chart) platform for enterprise data.☆21Nov 18, 2019Updated 6 years ago
- ItemBased Collaborative Filtering in Apache Spark☆21Aug 26, 2016Updated 9 years ago
- Spark Streaming实时流处理项目实战☆18Jul 12, 2025Updated 7 months ago
- A collection of examples illustrating data processing, data science, and machine learning on the Pivotal Greenplum and HAWQ MPP databases☆20Apr 26, 2016Updated 9 years ago
- Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit i…☆43Dec 19, 2013Updated 12 years ago
- 总结OCR领域的主流公开数据集,包含检测&识别、各种场景、各种语言的数据集,并提供数据集的相关信息及下载链接。☆32Aug 21, 2022Updated 3 years ago
- Mnj (Mongo Energy) is a helper library to simplify PyMongo interaction☆26Nov 19, 2024Updated last year
- Java 8 Factorization Machines Library☆28Feb 17, 2017Updated 8 years ago
- HanLP Chinese Analysis Plugin for Elasticsearch http://www.elasticsearch.org☆19Aug 10, 2016Updated 9 years ago