tresata / spark-scaldingView external linksLinks
Use Cascading Taps and Scalding DSL with Spark
☆49Dec 28, 2016Updated 9 years ago
Alternatives and similar repositories for spark-scalding
Users that are interested in spark-scalding are comparing it to the libraries listed below
Sorting:
- Integration for Cascading and Apache Hive☆25Oct 31, 2017Updated 8 years ago
- Kafka consumer & producer for scalaz-stream☆12Dec 15, 2017Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Apr 18, 2017Updated 8 years ago
- A command-line parser for Scala☆65Nov 15, 2019Updated 6 years ago
- spray based client for aws☆38May 4, 2016Updated 9 years ago
- A full feaured Java-based template engine for Play2☆57Jul 3, 2014Updated 11 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Feb 11, 2020Updated 6 years ago
- Use AlluxioBlockManager to intead TachyonBlockManager as spark's off_heap.☆14Nov 3, 2016Updated 9 years ago
- unix domain sockets that look just like tcp sockets☆11Jun 21, 2018Updated 7 years ago
- Sparse feature extraction with Spark☆30Jul 25, 2018Updated 7 years ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- Cascading on Apache Flink®☆54Feb 5, 2024Updated 2 years ago
- A Scala Collection for Multiple Access Patterns☆12Oct 22, 2016Updated 9 years ago
- Akka streams extension☆13Apr 25, 2021Updated 4 years ago
- Gigahorse plugin for Github API v3☆12Jun 24, 2018Updated 7 years ago
- Zipkin Mesos Framework☆31Feb 24, 2016Updated 9 years ago
- Simple Job Workflow API for Tasks☆32Dec 5, 2025Updated 2 months ago
- Yahoo! Cloud Serving Benchmark☆20Jul 20, 2015Updated 10 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- ☆16Feb 20, 2016Updated 9 years ago
- Scala Data access for NoSQL databases☆47Jun 4, 2013Updated 12 years ago
- Type-Safe framework for defining, modifying, and querying SQL databases☆59Apr 18, 2024Updated last year
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- Library for deep embedding of DSLs based on Scala macros.☆75Jan 12, 2016Updated 10 years ago
- Macro-based type providers for Scala (examples)☆85Aug 31, 2015Updated 10 years ago
- A whole bunch of functions, filters, and other tools that make writing Cascading flows a joy☆54Mar 19, 2023Updated 2 years ago
- Monad transformers for exception handling☆17Aug 19, 2024Updated last year
- small Scala veneer over JGit☆21Sep 17, 2025Updated 4 months ago
- 极客班第一期学员的C++作业请提交到这里,按照自己的学生编号创建文件夹☆12Oct 26, 2015Updated 10 years ago
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- A Spark-based LexRank extractive summarizer for text documents☆19Dec 23, 2015Updated 10 years ago
- Example of an implementation of SQL as a DSL in scala☆53Nov 4, 2010Updated 15 years ago
- Rapid prototyping HTTP toolkit based on Netty. Supports container-style jars, multi-hosting, REST primitives.☆19Jul 14, 2013Updated 12 years ago
- Tiny publish subscribe library☆15Mar 27, 2017Updated 8 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- migrated to git.rossabaker.com☆16Sep 4, 2025Updated 5 months ago
- Cats instances for fastparse☆18May 6, 2018Updated 7 years ago
- ☆19Sep 8, 2017Updated 8 years ago
- Code for Springer Book: High Performance Distributed Computing: Case Studies with Hadoop, Scalding and Spark☆15Oct 6, 2017Updated 8 years ago