baidu / bigflowLinks
Baidu Bigflow is an interface that allows for writing distributed computing programs and provides lots of simple, flexible, powerful APIs. Using Bigflow, you can easily handle data of any scale. Bigflow processes 4P+ data inside Baidu and runs about 10k jobs every day.
☆1,136Updated 3 weeks ago
Alternatives and similar repositories for bigflow
Users that are interested in bigflow are comparing it to the libraries listed below
Sorting:
- Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosop…☆673Updated 6 years ago
- An industrial deep learning framework for high-dimension sparse data☆4,306Updated last year
- An Internet-Scale Database.☆1,907Updated last year
- 腾讯高性能分布式图计算框架Plato☆1,913Updated 4 years ago
- AI on Hadoop☆1,733Updated 6 months ago
- Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protob…☆2,097Updated last week
- A lightweight parameter server interface☆1,557Updated 2 years ago
- Galaxy is a cluster management system.☆327Updated 8 years ago
- a TensorFlow-based distributed training framework optimized for large-scale sparse data.☆332Updated last week
- A Toolkit for Industrial Topic Modeling☆2,646Updated 4 years ago
- ODPS Python SDK and data analysis framework☆447Updated 3 weeks ago
- BaikalDB, A Distributed HTAP Database.☆1,227Updated 2 months ago
- A high-availability, high-throughput and highly reliable distributed queue based on the Paxos algorithm.☆1,905Updated 2 years ago
- CTR prediction model based on spark(LR, GBDT, DNN)☆921Updated 5 years ago
- A full-text search engine supporting massive users, real-time updating, fast fuzzy matching and flexible table splitting.☆496Updated 2 years ago
- A distributed graph deep learning framework.☆2,902Updated 2 years ago
- Deep Learning Chinese Word Segment☆2,077Updated 7 years ago
- spark ml 算法原理剖析以及具体的源码实现分析☆1,960Updated 6 years ago
- PArallel Distributed Deep LEarning (PaddlePaddle核心框架,高性能单机、分布式训练和跨平台部署)☆34Updated last year
- A light-weight RPC implement of google protobuf RPC framework.☆2,148Updated 2 years ago
- Multi-thread implementation of Factorization Machines with FTRL for binary-class classification problem.☆907Updated 4 years ago
- A Flexible and Powerful Parameter Server for large-scale machine learning☆6,785Updated 2 months ago
- Distributed training framework with parameter server☆338Updated 9 years ago
- Apache Pegasus - A horizontally scalable, strongly consistent and high-performance key-value store☆2,040Updated last week
- A light weight, super fast, large scale machine learning library on spark .☆680Updated 7 years ago
- CUP, common useful python-lib. (Currently, Most popular python lib in baidu). Python 开发底层库, 涵盖util、service(threadpool/generator/executo…☆954Updated last year
- Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logi…☆351Updated 3 years ago
- A stand alone industrial serving system for angel.☆65Updated 3 years ago
- Common library☆133Updated 8 years ago
- Distributed LR、 FM model on Parameter Server. FTRL and SGD Optimization Algorithm.☆224Updated 7 years ago