Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.
☆37Aug 9, 2023Updated 2 years ago
Alternatives and similar repositories for dataflow-java
Users that are interested in dataflow-java are comparing it to the libraries listed below
Sorting:
- Paper elements by Google translated to React☆13Nov 20, 2014Updated 11 years ago
- 迁移工具,目标是Oracle,MySQL,SqlServer到PostgreSQL的单项迁移,PostgreSQL和大数据平台Hive,Hbase,Impala等的双向迁移。☆10Dec 3, 2014Updated 11 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- ☆84Jan 26, 2026Updated last month
- CSI driver for EdgeFS☆11Oct 23, 2019Updated 6 years ago
- 基于ActiveMQ的数据交换中间件☆14Aug 17, 2014Updated 11 years ago
- Java Bindings (JNI) for bwa☆20Dec 15, 2016Updated 9 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆17Mar 22, 2017Updated 8 years ago
- ETL for moving Ethereum data to Neo4j database☆20Mar 30, 2020Updated 5 years ago
- Apache Hudi Demo☆22Apr 24, 2025Updated 10 months ago
- 个性化推荐算法的通用处理框架,基于Mahout和Lucene☆18May 25, 2015Updated 10 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆851Nov 25, 2020Updated 5 years ago
- Convert JSON from document-oriented DB to neo4j graph☆21Dec 27, 2021Updated 4 years ago
- The admin user interface for CrateDB.☆28Feb 14, 2026Updated last week
- Implementation of Google Cloud Pub/Sub backed by Apache Kafka.☆59Jan 9, 2023Updated 3 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Feb 25, 2014Updated 12 years ago
- Apache Fluo Muchos☆26Dec 6, 2024Updated last year
- 解析Mysql binlog日志并发至Kafka☆23Nov 25, 2016Updated 9 years ago
- Processing Logs at Scale using Cloud Dataflow☆62Mar 18, 2019Updated 6 years ago
- Vert.x Metrics☆34Feb 16, 2026Updated last week
- Java Fast NIO Socks 4/5 Proxy based on Netty.io☆37Jan 17, 2012Updated 14 years ago
- ☆40Aug 3, 2015Updated 10 years ago
- Apache NLPCraft - API to convert natural language into actions.☆83May 22, 2025Updated 9 months ago
- ☆31Mar 21, 2016Updated 9 years ago
- 本项目转移到https://github.com/cocolian/cocolian-nlp☆34Jun 8, 2014Updated 11 years ago
- RDMA for HDFS☆27Oct 29, 2018Updated 7 years ago
- Modern Style, a framework for optimizing SASS on web applications and sites.☆11Jan 14, 2015Updated 11 years ago
- Mirror of Apache Whirr☆95Apr 28, 2017Updated 8 years ago
- 各种安全相关思维导图整理收集☆11Sep 7, 2015Updated 10 years ago
- Library for interacting with Eventbrite (old api, yanked from Rubygems, for V3 - see: https://github.com/envoy/eventbrite)☆22Aug 18, 2012Updated 13 years ago
- hadoop中Map/Reduce使用示例,输入(DBInputFormat),输出(DBOutputFormat)为MySql数据库表、日志分析Grep、单词排序Sort...对HBase的基本操作,增、删、查、改,使用Map/Reduce批量导入数据到HBase表中..…☆14Apr 6, 2013Updated 12 years ago
- Solução Opensource de backup corporativo desenvolvida para empresas e governos☆11Sep 21, 2011Updated 14 years ago
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- 这是居于 derby 源代码,通过删减的方式,从里面抽取出sql解析功能。并在此基础上开发出跨库连接查询器。通过该工具可以将连接查询分割成多个单表查询,再将单表结果集进行连接,即将数据库的连接功能上移到工具执行。详情可以查看wiki:readme☆10Feb 14, 2017Updated 9 years ago
- ☆11Sep 1, 2022Updated 3 years ago
- [Plant Phenomics] Eff-3DPSeg: 3D organ-level plant shoot segmentation using annotation-efficient deep learning☆10Jul 10, 2023Updated 2 years ago
- Servermon is a Django project with the aim of facilitating server monitoring and management through Puppet☆24Dec 11, 2018Updated 7 years ago
- ☆11Jan 15, 2025Updated last year
- This Pinyin Analysis plugin is used to do conversion between Chinese characters and Pinyin.☆10Mar 28, 2019Updated 6 years ago