☆286Jan 5, 2024Updated 2 years ago
Alternatives and similar repositories for document_classfication
Users that are interested in document_classfication are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。☆1,755Apr 18, 2026Updated 2 weeks ago
- ✏️[计算机基础+java基础+大数据基础及进阶+面试指南] 一份涵盖计算机基础,java,大数据,面试宝典,大部分核心知识的项目,学习,面试,共同进步!☆86Jul 6, 2023Updated 2 years ago
- presto、trino资料分享,开发文档、源码阅读、二次开发。☆65Jan 19, 2025Updated last year
- 专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...☆10,457Aug 7, 2023Updated 2 years ago
- apache flink learning☆11Jun 21, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 智数通提供了元数据管理、数据标准管理、数据质量管理、主数据管理、数据集市管理、可视化图表看板、流程管理等微服务,是为数字化建设而生的企业级一站式数据治理平台。☆14May 30, 2022Updated 3 years ago
- 【2026最新版】 大数据 数据分析 电商系统 实时数仓 离线数仓 数据湖 建设方案及实战代码,涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。☆1,113Apr 26, 2026Updated last week
- 🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。产品正式演示体验、社群咨询、商务采购:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo☆3,020Apr 27, 2026Updated last week
- The next generation of cloud-native big data management expert , Aims to help users rapidly build stable, efficient, and scalable cloud-n…☆1,314Jul 22, 2025Updated 9 months ago
- Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.☆3,728Apr 20, 2026Updated 2 weeks ago
- rust_edu☆11Jul 17, 2022Updated 3 years ago
- Flink CDC is a streaming data integration tool☆6,415Updated this week
- Spark源码阅读(基于2.4.4)☆32Mar 22, 2020Updated 6 years ago
- 大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料☆3,175Jan 20, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆47May 5, 2021Updated 5 years ago
- SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offlin…☆827Jan 22, 2026Updated 3 months ago
- Make stream processing easier! Easy-to-use streaming application development framework and operation platform.☆4,308Mar 26, 2026Updated last month
- Tutorials on how to Query Data☆13Jan 7, 2023Updated 3 years ago
- 大数据组件学习代码☆65May 6, 2024Updated 2 years ago
- CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source bi…☆492Oct 31, 2025Updated 6 months ago
- 该项目整合了多款优秀的开源产品,构建了一个功能全面的数据开发平台。平台提供了强大的数据集成、数据开发、数据查询、数据服务、数据质量管理、工作流调度和元数据管理功能。#dinky #dolphinscheduler #datavines #flinkcdc #openmeta…☆637Aug 5, 2025Updated 9 months ago
- freeswitch cdr☆15Jul 12, 2025Updated 9 months ago
- Data Dialogue enables natural language querying of databases by integrating LLMs with SQL databases.☆14May 3, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dolphinscheduler best practices 海豚调度最佳实践☆19Sep 10, 2024Updated last year
- 基于Netty的小型RPC框架☆15Dec 2, 2018Updated 7 years ago
- flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Ta…☆15,052Apr 14, 2026Updated 3 weeks ago
- 基于Spark的新闻推荐系统☆11Dec 7, 2024Updated last year
- SeaTunnel is a multimodal, high-performance, distributed, massive data integration tool.☆9,302Updated this week
- springboot demo combined with scala and java☆11Dec 7, 2017Updated 8 years ago
- 数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,数据中台、数据湖、数据治理、数仓建设、数据化转型等☆451Aug 8, 2025Updated 8 months ago
- LarkMidTable 是一站式开源的数据中台,实现中台的 基础建设,数据治理,数据开发,监控告警,数据服务,数据的可视化,实现高效赋能数据前台并提供数据服务的产品。☆2,039Aug 20, 2023Updated 2 years ago
- 利用Druid SQL Parser解析HiveSQL日志,自动构建字段级别的血缘关系及主外键的自动抽取☆44Feb 6, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.☆737Apr 18, 2026Updated 2 weeks ago
- Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI☆1,298Apr 21, 2026Updated 2 weeks ago
- sample code of api gateway☆22Mar 19, 2021Updated 5 years ago
- [大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结☆1,656Aug 30, 2021Updated 4 years ago
- 深圳地铁大数据客流分析系统🚇🚄🌟☆2,454May 16, 2024Updated last year
- 基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法☆2,055Feb 21, 2024Updated 2 years ago
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,127Apr 23, 2026Updated last week