A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.
☆212Nov 19, 2014Updated 11 years ago
Alternatives and similar repositories for crunch
Users that are interested in crunch are comparing it to the libraries listed below
Sorting:
- Coordinate job queue system, Go implementation☆29Apr 10, 2025Updated 10 months ago
- Node.js ETL (Extract, Transform, Load) toolkit for easy data import, export or transfer between systems.☆295Jul 29, 2018Updated 7 years ago
- ☆11Jun 14, 2015Updated 10 years ago
- Simple Redis Go recommendation engine☆61Sep 8, 2016Updated 9 years ago
- Built to allow easy Quartz setup within ServiceStack.☆12Jan 6, 2024Updated 2 years ago
- libsvm go version☆72May 9, 2016Updated 9 years ago
- Docker image as a tsuru service.☆11Oct 22, 2019Updated 6 years ago
- ☆13Jan 15, 2017Updated 9 years ago
- A ngram indexing implement in Go☆12Oct 13, 2022Updated 3 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- A sentry (raven-go) middleware for echo micro web framework☆16May 5, 2018Updated 7 years ago
- Add Let's Encrypt (ACME) support to generate and renew SSL certificates to go servers using the DNS provider challenge.☆12Jun 20, 2016Updated 9 years ago
- The fiber-based proxy for the micro services.☆11Jan 27, 2015Updated 11 years ago
- one large file contains a billion of small files☆14Mar 7, 2014Updated 12 years ago
- A cron library for go, support redis to execute only one same job in multi instances.☆12Jan 6, 2025Updated last year
- Trident State implementation on top of Elasticsearch☆21May 18, 2015Updated 10 years ago
- 透過 Google Gemini-Pro 打造給食物愛好者的聊天機器人☆17Jul 31, 2024Updated last year
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆13May 3, 2019Updated 6 years ago
- Go client library for SeatGeek's Sixpack AB testing framework.☆20Sep 2, 2013Updated 12 years ago
- Redis search and indexing in Java☆16Sep 26, 2016Updated 9 years ago
- Data-Centric Pipelines and Data Versioning☆6,288Feb 3, 2025Updated last year
- webterm is a simple web-based terminal built with golang and javascript☆20Aug 10, 2015Updated 10 years ago
- Workflow engine for various computing systems.☆26Mar 30, 2017Updated 8 years ago
- A simple implementation of Bloom Filter and Scalable Bloom Filter for Python 3.☆17Aug 11, 2018Updated 7 years ago
- Sync data between persistence engines, like ETL only not stodgy☆1,447Oct 17, 2023Updated 2 years ago
- WIP : this is a playground, code is messy, exploring various ideas...☆16Jul 1, 2017Updated 8 years ago
- A communication bus with a strict request/response interface, aiming to make unit testing simpler, by treating side effects as data.☆18Mar 1, 2023Updated 3 years ago
- A declarative, SQL-like DSL for data integration tasks.☆14Jul 4, 2018Updated 7 years ago
- This project is a unified ETL platform that support various data processing technologies, including Spark, Hive, Hadoop, Python, Linux Sh…☆17Oct 16, 2015Updated 10 years ago
- Go library for writing standalone Map/Reduce jobs or for use with Hadoop's streaming protocol☆104Mar 6, 2014Updated 12 years ago
- A listing of gophers☆19Apr 19, 2015Updated 10 years ago
- Concord Go(lang) client☆21Mar 22, 2016Updated 9 years ago
- Slack bot written in Go☆24Jan 6, 2020Updated 6 years ago
- A fault-tolerant, distributed cluster of Redis servers with built-in load-balancing and fall-backs to provide data availability☆22Feb 18, 2022Updated 4 years ago
- Elixir Toggl API Wrapper☆18Nov 23, 2017Updated 8 years ago
- Go http handler access logger☆20Aug 3, 2016Updated 9 years ago
- OpenRTB implemented in PHP☆18Aug 9, 2016Updated 9 years ago
- A Distributed HTTP Load Generator, based on rakyll/boom and Kubernetes (Work in Progress)☆27May 5, 2023Updated 2 years ago
- Logging plugin for Supervisor☆48Jul 14, 2021Updated 4 years ago