C4.5 is a commonly used in decision tree algorithm in data mining for classification. The existing C4.5 algorithm implementation is running in serial way. We are implementing this algorithm using Hadoop MapReduce framework which can run parallel in multiple system.
☆14May 6, 2014Updated 12 years ago
Alternatives and similar repositories for C4.5-using-hadoop-map-reduce-framework
Users that are interested in C4.5-using-hadoop-map-reduce-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simhash算法实现海量内容查重☆14Apr 23, 2016Updated 10 years ago
- ☆13Mar 2, 2016Updated 10 years ago
- Rossmann Store Sales: https://www.kaggle.com/c/rossmann-store-sales☆10May 13, 2018Updated 8 years ago
- ☆11May 8, 2020Updated 6 years ago
- 在Docker容器中运行Hadoop大数据组件和机器学习平台☆11Apr 3, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Oct 23, 2017Updated 8 years ago
- Movie Recommendation System using Apache Spark and Python implementing User based Collaborative Filtering and Item Based Collaborative Fi…☆12Mar 13, 2016Updated 10 years ago
- doddle-model code examples☆19Sep 23, 2019Updated 6 years ago
- ☆16May 31, 2017Updated 9 years ago
- Implementation of the paper "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting", https://arxi…☆19Jul 20, 2021Updated 4 years ago
- some strategies for exposure bias in seq2seq☆18Sep 9, 2020Updated 5 years ago
- Case Recommender: A Flexible and Extensible Python Framework for Recommender Systems☆14Jul 2, 2018Updated 7 years ago
- AIOps相关资料☆20Sep 7, 2018Updated 7 years ago
- 贝叶斯思维☆15Jan 5, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A text classifier based on Decision Trees ID3, Naive Bayes and KNN algorithm in C++ and JAVA.☆40Nov 30, 2017Updated 8 years ago
- code for ResSys'18 paper: "Exploring Recommendations Under User-Controlled Data Filtering"☆23Oct 16, 2018Updated 7 years ago
- MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting☆20Jul 11, 2023Updated 2 years ago
- ☆12May 28, 2024Updated 2 years ago
- ☆25Jun 11, 2016Updated 10 years ago
- Pipeline of Data Extraction, Preprocessing, Representation, and Training for MIMIC-III☆18Nov 16, 2020Updated 5 years ago
- ☆24Jun 10, 2017Updated 9 years ago
- Bayesian network structure learning☆18May 22, 2022Updated 4 years ago
- implementation of https://www.usenix.org/system/files/conference/nsdi14/nsdi14-paper-bhagwan.pdf☆14Nov 23, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Demo on how to integrate Spring Data JPA, Apache Spark and GraphX with Java and Scala mixed codes☆19May 14, 2018Updated 8 years ago
- 原始项目:[GCN_AAAI2019](https://github.com/yao8839836/text_gcn/) add:多标签分类支持☆18Dec 17, 2019Updated 6 years ago
- Notes from Georgia Tech's CS7641 and Tom Mitchell's "Machine Learning."☆27Feb 20, 2014Updated 12 years ago
- use deepar to predict water supply network pressure☆20Feb 2, 2021Updated 5 years ago
- https://www.kaggle.com/c/home-credit-default-risk#description☆20Jul 30, 2018Updated 7 years ago
- N-BEATS: Neural basis expansion analysis for interpretable time series forecasting.☆23Jun 28, 2019Updated 6 years ago
- 融合专家知识的贝叶斯网络结构学习☆19Jan 4, 2022Updated 4 years ago
- Lenovo Y50 Subwoofer Enabler for Linux☆34Jan 26, 2021Updated 5 years ago
- Use icons your console log messages. It's console.icon()☆48Apr 12, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆34Jul 9, 2024Updated last year
- 腾讯社交广告算法大赛2018☆41Jul 9, 2018Updated 7 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32May 29, 2018Updated 8 years ago
- a rude awakening☆50May 5, 2021Updated 5 years ago
- 基于sklearn,强化Pipeline和FeatureUnion两个类。对FeatureUnion类,使其支持部分数据处理;对两者,增加特征转换行为记录的功能。☆29Jul 28, 2016Updated 9 years ago
- ☆27Feb 12, 2019Updated 7 years ago
- Baidu 95categories of multi-label test question classification☆26Apr 8, 2020Updated 6 years ago