baidu / bigflow
Baidu Bigflow is an interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs. Using Bigflow, you can easily handle data of any scale. Bigflow processes 4P+ data inside Baidu and runs about 10k jobs every day.
☆1,136Updated 2 years ago
Alternatives and similar repositories for bigflow:
Users that are interested in bigflow are comparing it to the libraries listed below
- A Toolkit for Industrial Topic Modeling☆2,639Updated 3 years ago
- Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protob…☆2,060Updated 3 weeks ago
- 腾讯高性能分布式图计算框架Plato☆1,902Updated 3 years ago
- An Internet-Scale Database.☆1,892Updated 7 months ago
- Deep Learning Chinese Word Segment☆2,082Updated 6 years ago
- Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosop…☆673Updated 5 years ago
- A light-weight RPC implement of google protobuf RPC framework.☆2,138Updated last year
- Galaxy is a cluster management system.☆326Updated 7 years ago
- AI on Hadoop☆1,730Updated 6 months ago
- BaikalDB, A Distributed HTAP Database.☆1,197Updated last month
- A full-text search engine supporting massive users, real-time updating, fast fuzzy matching and flexible table splitting.☆486Updated last year
- A lightweight parameter server interface☆1,543Updated 2 years ago
- An industrial deep learning framework for high-dimension sparse data☆4,267Updated 3 months ago
- ☆321Updated this week
- ODPS Python SDK and data analysis framework☆433Updated this week
- CUP, common useful python-lib. (Currently, Most popular python lib in baidu). Python 开发底层库, 涵盖util、service(threadpool/generator/executo…☆943Updated last month
- Apache Pegasus - A horizontally scalable, strongly consistent and high-performance key-value store☆1,989Updated this week
- PaxosStore has been deployed in WeChat production for more than two years, providing storage services for the core businesses of WeChat b…☆1,690Updated 4 years ago
- Baidu Elasticsearch☆433Updated 6 years ago
- A distributed graph deep learning framework.☆2,901Updated last year
- PArallel Distributed Deep LEarning (PaddlePaddle核心框架,高性能单机、分布式训练和跨平台部署)☆29Updated 2 months ago
- A distributed in-memory NOSQL system based on TARS framework, support LRU algorithm and data persists on back-end database. Users can ea…☆749Updated 11 months ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,838Updated 7 months ago
- CTR prediction model based on spark(LR, GBDT, DNN)☆908Updated 4 years ago
- ☆1,629Updated 2 months ago
- A high-availability, high-throughput and highly reliable distributed queue based on the Paxos algorithm.☆1,904Updated last year
- An open-source columnar data format designed for fast & realtime analytic with big data.☆454Updated 2 years ago
- Distributed training framework with parameter server☆338Updated 8 years ago
- embedx 是基于 c++ 开发的、完全自研的分布式 embedding 训练和推理框架。它目前支持 图模型、深度排序、召回模型和图与排序、图与召回的联合训练模型等☆300Updated 7 months ago
- 中文文档simhash值计算☆1,116Updated last month