C4.5 is a commonly used in decision tree algorithm in data mining for classification. The existing C4.5 algorithm implementation is running in serial way. We are implementing this algorithm using Hadoop MapReduce framework which can run parallel in multiple system.
☆14May 6, 2014Updated 11 years ago
Alternatives and similar repositories for C4.5-using-hadoop-map-reduce-framework
Users that are interested in C4.5-using-hadoop-map-reduce-framework are comparing it to the libraries listed below
Sorting:
- Rossmann Store Sales: https://www.kaggle.com/c/rossmann-store-sales☆10May 13, 2018Updated 7 years ago
- ☆11May 8, 2020Updated 5 years ago
- ☆12Apr 19, 2024Updated last year
- implementation of https://www.usenix.org/system/files/conference/nsdi14/nsdi14-paper-bhagwan.pdf☆14Nov 23, 2018Updated 7 years ago
- This project includes different demos based on Lightstreamer StockList Adapter☆21Aug 1, 2023Updated 2 years ago
- ☆16May 31, 2017Updated 8 years ago
- Case Recommender: A Flexible and Extensible Python Framework for Recommender Systems☆14Jul 2, 2018Updated 7 years ago
- AIOps相关资料☆20Sep 7, 2018Updated 7 years ago
- 贝叶斯思维☆15Jan 5, 2019Updated 7 years ago
- simhash算法实现海量内容查重☆14Apr 23, 2016Updated 9 years ago
- Bayesian network structure learning☆18May 22, 2022Updated 3 years ago
- 原始项目:[GCN_AAAI2019](https://github.com/yao8839836/text_gcn/) add:多标签分类支持☆18Dec 17, 2019Updated 6 years ago
- some strategies for exposure bias in seq2seq☆18Sep 9, 2020Updated 5 years ago
- Demo on how to integrate Spring Data JPA, Apache Spark and GraphX with Java and Scala mixed codes☆19May 14, 2018Updated 7 years ago
- A simple, scalable, and highly efficient web crawler framework for Java.☆25Jan 4, 2018Updated 8 years ago
- N-BEATS: Neural basis expansion analysis for interpretable time series forecasting.☆23Jun 28, 2019Updated 6 years ago
- ☆20May 24, 2016Updated 9 years ago
- https://www.kaggle.com/c/home-credit-default-risk#description☆20Jul 30, 2018Updated 7 years ago
- A deep learning neural network for abstractive deep summarization☆20Dec 23, 2019Updated 6 years ago
- 使用java调用tensorflow,Keras模型☆24May 2, 2018Updated 7 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 7 years ago
- ☆27Feb 12, 2019Updated 7 years ago
- 教材 Causal Inference: What if 的编译和解读!☆35May 16, 2020Updated 5 years ago
- 基于sklearn,强化Pipeline和FeatureUnion两个类。对FeatureUnion类,使其支持部分数据处理;对两者,增加特征转换行为记录的功能。☆29Jul 28, 2016Updated 9 years ago
- Baidu 95categories of multi-label test question classification☆26Apr 8, 2020Updated 5 years ago
- JData算法大赛☆31Aug 16, 2017Updated 8 years ago
- A text classifier based on Decision Trees ID3, Naive Bayes and KNN algorithm in C++ and JAVA.☆39Nov 30, 2017Updated 8 years ago
- Classification of emotions based on age and pulse rate using Support Vector Machines☆30Aug 10, 2015Updated 10 years ago
- Some tools that I often find myself using in Kaggle challenges.☆32Jan 8, 2026Updated last month
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆30Nov 8, 2018Updated 7 years ago
- JPMML-SparkML plugin for converting XGBoost4J-Spark models to PMML☆36Mar 25, 2020Updated 5 years ago
- 腾讯社交广告算法大赛2018☆41Jul 9, 2018Updated 7 years ago
- A news recommendation evaluation framework☆44Oct 17, 2018Updated 7 years ago
- Kaggle challenge Bag of words meets bags of popcorn in Python 3☆36Feb 14, 2018Updated 8 years ago
- Spark学习笔记☆45Mar 23, 2023Updated 2 years ago
- Project work for the Udacity Data Analyst Nanodegree☆39Jun 29, 2017Updated 8 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Jul 4, 2018Updated 7 years ago
- ☆54Aug 26, 2018Updated 7 years ago
- ☆47Jul 10, 2023Updated 2 years ago