HouJP / my-mllib
The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementations of these algorithms.
☆15Updated 3 years ago
Alternatives and similar repositories for my-mllib:
Users that are interested in my-mllib are comparing it to the libraries listed below
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32Updated 6 years ago
- add ftrl_fm cython implementation☆13Updated 8 years ago
- Using gbdt+lr in recommend system and comparing the auc of lr, gbdt, gbdt+lr.☆24Updated 7 years ago
- spark,NLP,新词发现,自然语言处理☆23Updated 7 years ago
- CIKM 2019 E-Commerce AI Challenge - 超大规模推荐之用户兴趣高效检索☆11Updated 3 years ago
- ☆12Updated 8 years ago
- An implementation of GBDT+FM☆24Updated 8 years ago
- 7th in a competition organised by ICT☆24Updated 9 years ago
- 通过对于现有开源分布式机器学习工具的整合(主要是基于参数服务器的logistic regression,xgboost,FFM,FM ),打造一个工业级的,可以线上使用的点击率预估流水线☆26Updated 7 years ago
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Updated 8 years ago
- spark-simrank scala☆17Updated 8 years ago
- Recommender-In-Detail is a package which offers detailed implementations of state-of-the-art techniques and basic methods in recommendati…☆19Updated 5 years ago
- 基于Spark的LambdaMART实现☆11Updated 10 years ago
- 2016 ccf 依据用户轨迹的商户精准营销☆18Updated 8 years ago
- use xgboost and lr model for text classification. xgboost is used to be a feature transform for LR☆44Updated 7 years ago
- CTR prediction models based on spark(LR,FM、XGBoost、XGBoostLR、XGBoostFM)☆35Updated 4 years ago
- 蚂蚁金服-用户精确定位比赛☆12Updated 7 years ago
- 基于sklearn,强化Pipeline和FeatureUnion两个类。对FeatureUnion类,使其支持部分数据处理;对两者,增加特征转换行为记录的功能。☆29Updated 8 years ago
- convert DataFrame to libffm data format in parallel☆30Updated 6 years ago
- 2016CCF-sougou-code&PPT☆55Updated 8 years ago
- Contextual Recommendation Implementation for Research Purposes☆19Updated 9 months ago
- 计算广告学习笔记☆24Updated 3 years ago
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Updated 6 years ago
- Deep structured semantic model☆32Updated 8 years ago
- Deep Learning Pipelines for Apache Spark☆58Updated 7 years ago
- code exercise: dbscan(ballTree improve) | ctr(ftrl) | text classification(bayes..) | kmeans | general LR |..☆26Updated 9 years ago
- 2018-JData-联通-基于移动网络通讯行为的风险用户识别:Baseline 0.77☆21Updated 6 years ago
- ☆27Updated 7 years ago
- A potential 22nd rank solution to Criteo Labs Display Advertising Challenge on Kaggle☆25Updated 7 years ago
- CCF_大数据精准营销中搜狗用户画像挖掘☆17Updated 8 years ago