Spark stream from kafka(json) to s3(parquet)
☆15Nov 8, 2018Updated 7 years ago
Alternatives and similar repositories for jaquet
Users that are interested in jaquet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- spark自学手册,包含了例如spark core、spark sql、spark streaming、spark-kafka、delta-lake,以及scala基础练习,还有一些例如master、shuffle源码分析,总结及翻译。☆18Jul 19, 2023Updated 2 years ago
- ☆13Jul 8, 2023Updated 2 years ago
- A testing DSL for kafka-streams☆14Aug 9, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆17Oct 13, 2017Updated 8 years ago
- A type driven approach to string interpolation, aiming at consistent, secure, and only-human-readable logs and console outputs !☆14May 5, 2024Updated 2 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 10 years ago
- Fork to add support for assumed roles☆15Sep 15, 2022Updated 3 years ago
- My HackerRank Solutions : https://www.hackerrank.com/RohanKhude☆12Jul 13, 2016Updated 9 years ago
- AWS SSM in Action, the next generation of SSH☆23Mar 14, 2018Updated 8 years ago
- A SimpleDB Administration web-interface☆28Oct 30, 2023Updated 2 years ago
- NICTA Named Entity Recogniser is a rule based Named Entity Recogniser which extracts named entities from text such as Organisation, Locat…☆16Apr 15, 2023Updated 3 years ago
- Manage your AWS infrastructure and ECS tasks with two separate ansible playbooks☆24Mar 9, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Workshop for Spark and Databricks☆55Dec 6, 2019Updated 6 years ago
- Algorithms and Data Structures implemented in Java☆12Jul 28, 2019Updated 6 years ago
- PoC using scala that defines single-message protobuf schema per Kafka topic☆23Feb 7, 2020Updated 6 years ago
- Apache Flink 学习的Demo☆10Jun 21, 2017Updated 8 years ago
- How to use Parquet in Flink☆32May 2, 2017Updated 9 years ago
- 一些机器学习的实践☆11Jun 29, 2022Updated 3 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Oct 15, 2022Updated 3 years ago
- Create a data mart using Azure Data Factory as ELT / ETL, Azure Synapse as database and Power BI as visualization tool.☆19Apr 20, 2022Updated 4 years ago
- 请求spark rest API获取applications,jobs,stages,executors,rdds,streaming,environment等信息提供监控和报警服务☆11Nov 22, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Template to deploy Synapse Analytics using best practices to deliver a proof of concept.☆21Mar 3, 2023Updated 3 years ago
- Examples for Apache Oozie book☆18May 30, 2016Updated 9 years ago
- IoT Trucking App with Flink (with Table API & SQL)☆14Jul 4, 2018Updated 7 years ago
- Competitive Programming Solutions - Majorly in Java. Timely update for space and time efficiency☆19Mar 5, 2023Updated 3 years ago
- Like jq, but with json pointers☆16Nov 30, 2025Updated 5 months ago
- Create an Amazon AWS Mesos cluster using Terraform☆12Feb 15, 2017Updated 9 years ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- json或SQL语言转为flink或者spark流/批任务☆12Jun 21, 2022Updated 3 years ago
- I'll munch some data here☆12Jun 18, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CLI and Go Clients to manage Kafka components (Kafka Connect & SchemaRegistry)☆29May 17, 2017Updated 9 years ago
- sample oozie workflows☆17Jun 13, 2017Updated 8 years ago
- 基于FastAPI + LangChain + OpenAI API + Vue的AI表格处理工具,用于智能化处理和分析表格数据。☆20Jul 14, 2025Updated 10 months ago
- 这是一个由LangGraph协议主导的因果分析Muti-Agent,结合MCP,RAG等多种工具进行辅助进行因果分析,提供给用户一份完善的因果分析的分析报告和因果图☆37May 18, 2026Updated last week
- DuckDB Explain Visualizer (DEV) based on pev2☆39Aug 22, 2025Updated 9 months ago
- ☆14Nov 3, 2016Updated 9 years ago
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Sep 16, 2019Updated 6 years ago