intenthq/pucket

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intenthq/pucket)

intenthq / pucket

Bucketing and partitioning system for Parquet

☆30

Alternatives and similar repositories for pucket

Users that are interested in pucket are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

intenthq / gitkv
View on GitHub
gitkv is a server for using git as a key value store for text files
☆14Jul 17, 2023Updated 3 years ago
intenthq / gander
View on GitHub
Html Content / Article Extractor in Scala
☆18May 23, 2018Updated 8 years ago
ontodev / howl
View on GitHub
HOWL: Humane OWL Format
☆16Mar 17, 2017Updated 9 years ago
meadsteve / british_food_generator
View on GitHub
🥧 Generates classic British dishes
☆13Dec 27, 2022Updated 3 years ago
pippokill / tri
View on GitHub
Temporal Random Indexing
☆14Oct 3, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
adrianulbona / borders
View on GitHub
☆17Jan 25, 2017Updated 9 years ago
jshook / perfscripts
View on GitHub
scripts to quickly measure system baseline performance
☆23Aug 1, 2025Updated 11 months ago
kasei / diomede
View on GitHub
LMDB-based RDF Quadstore implemented in Swift
☆14Oct 29, 2024Updated last year
abajwa-hw / nifi-network-processor
View on GitHub
Sample custom Nifi processor to process tcpdump
☆18Nov 19, 2015Updated 10 years ago
degupta / human_readable_json_protocol
View on GitHub
A way to convert Thrift Services and Functions into Human Readable JSON
☆18Feb 4, 2018Updated 8 years ago
implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
mstewartgallus / prologish
View on GitHub
☆12Sep 22, 2020Updated 5 years ago
cerndb / Hadoop-Profiler
View on GitHub
Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.
☆24Jul 7, 2016Updated 10 years ago
tresata / spark-skewjoin
View on GitHub
Joins for skewed datasets in Spark
☆58Aug 18, 2017Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yashbonde / transformer_network_tensorflow
View on GitHub
Tensorflow implementation of transformer network from "Attention is all you need" Paper. Also use cases of it!
☆16Jan 23, 2020Updated 6 years ago
datasalt / splout-db
View on GitHub
A web-latency SQL spout for Hadoop.
☆51Jan 25, 2021Updated 5 years ago
datadotworld / ckanext-datadotworld
View on GitHub
CKAN extension for data.world
☆12Dec 5, 2023Updated 2 years ago
busesese / DeepFM_Keras
View on GitHub
a model of deepfm using keras
☆12Apr 2, 2019Updated 7 years ago
liquidm / druid-dumbo
View on GitHub
☆21Mar 17, 2023Updated 3 years ago
solr-cool / solr-cool.github.io
View on GitHub
The Solr Package Directory and Sanctuary
☆13May 28, 2026Updated last month
jeroenvandijk / cascalog-graph
View on GitHub
Graph implementation for Cascalog
☆26May 11, 2014Updated 12 years ago
metamx / scala-util
View on GitHub
Scala stuff
☆18Jun 13, 2019Updated 7 years ago
zouzias / spark-lucenerdd-examples
View on GitHub
Examples of spark-lucenerdd
☆15Oct 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sematext / solr-researcher
View on GitHub
Solr SearchComponent for altering and re-executing queries that product poor results
☆13May 12, 2021Updated 5 years ago
grossws / solr-dvtf
View on GitHub
Apache Solr TextField with docValues support
☆11Mar 24, 2022Updated 4 years ago
s22s / pre-lt-raster-frames
View on GitHub
Spark DataFrames for earth observation data
☆20May 1, 2018Updated 8 years ago
robertjneal / rl
View on GitHub
Scala reinforcement learning framework
☆15Feb 11, 2022Updated 4 years ago
mladvladimir / sparqlom
View on GitHub
SPARQL query builder and DSL
☆10Mar 6, 2018Updated 8 years ago
auditNG / auditNG
View on GitHub
☆11Mar 9, 2018Updated 8 years ago
volkanaktas / TurtaRoleKontrol
View on GitHub
Raspberry Pi Turta röle kartını görsel arayüz üzerinden kontrol eden python dili ile yazılmış program
☆10Nov 30, 2016Updated 9 years ago
nexacenter / public-contracts
View on GitHub
☆10Apr 20, 2016Updated 10 years ago
webr3 / rdf.js
View on GitHub
RDF Tooling for ECMAScript V5 and Javascript
☆26Aug 5, 2011Updated 14 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aws-samples / aws-elastic-volumes
View on GitHub
Sample code to help with Elastic Block Store automation with Elastic Volumes feature
☆12Feb 24, 2017Updated 9 years ago
javieu / binanceBot
View on GitHub
A pretty simple bot for crypto alt coins for Binance, using TA4j strategies and indicators
☆14Dec 2, 2022Updated 3 years ago
GeoKnow / Jassa-Core
View on GitHub
The Jassa Core library consists of RDF core classes, SPARQL syntax classes and SPARQL service classes
☆28Jun 22, 2022Updated 4 years ago
lightcopy / parquet-index
View on GitHub
Spark SQL index for Parquet tables
☆134May 6, 2021Updated 5 years ago
bherrmann7 / table-explorer
View on GitHub
Allows a user to explore a databases tables visually
☆13Oct 29, 2024Updated last year
Steven-N-Hart / VariantDB_Challenge
View on GitHub
Finding a scalable alternative to the VCF File for genomics analysis
☆14Jan 5, 2017Updated 9 years ago
julianlam / nodebb-plugin-solr
View on GitHub
Full-text searching for NodeBB using Apache Solr
☆22Aug 30, 2024Updated last year