Benchmark of different ML algorithms on Criteo 1TB dataset
☆151May 10, 2017Updated 8 years ago
Alternatives and similar repositories for criteo-1tb-benchmark
Users that are interested in criteo-1tb-benchmark are comparing it to the libraries listed below
Sorting:
- High-performance key-value store☆12Dec 31, 2018Updated 7 years ago
- Advanced GBM Workshop - Budapest, Nov 2019☆13Nov 18, 2019Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Mar 24, 2023Updated 2 years ago
- ☆12Jan 24, 2017Updated 9 years ago
- A new version of phraug, which is a set of simple Python scripts for pre-processing large files☆207Jul 12, 2018Updated 7 years ago
- Drop-in wrapper for Vowpal Wabbit that adds hyper-parameter tuning, more performance metrics, text preprocessing, reading from csv/tsv, f…☆21Mar 23, 2018Updated 7 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Mar 19, 2018Updated 7 years ago
- Field-aware Factorization Machines on CUDA☆30Jan 15, 2026Updated last month
- This demo shows how to learn a neural network on top of decision trees☆32May 15, 2017Updated 8 years ago
- Java library and command-line application for converting XGBoost models to PMML☆131Feb 4, 2026Updated last month
- Fast high-dimensional exact KNN search.☆18Mar 1, 2017Updated 9 years ago
- LightCTR is a tensorflow 2.0 based, extensible toolbox for building CTR/CVR predicting models.☆103Oct 30, 2023Updated 2 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆97Jun 27, 2014Updated 11 years ago
- Simple and easy-to-understand ml algorithm implementations☆19Jun 17, 2017Updated 8 years ago
- GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems☆20May 13, 2018Updated 7 years ago
- The winning solution to the Ad Placement Challenge (NIPS'17 Causal Inference and Machine Learning Workshop)☆38Dec 10, 2017Updated 8 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…☆111Jun 14, 2017Updated 8 years ago
- implementation of factorization machine, support classification.☆20Jun 7, 2018Updated 7 years ago
- R interface to Vowpal Wabbit☆23Jul 1, 2019Updated 6 years ago
- An attempt of training DNN models to predict ad click-through rate, implemented with Theano.☆408Jun 12, 2017Updated 8 years ago
- Official repository of QuickRank: A C++ suite of Learning to Rank algorithms.☆131Apr 24, 2019Updated 6 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Feb 3, 2018Updated 8 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Dec 17, 2014Updated 11 years ago
- Nearest Neighbor Search in High Dimensional Spaces☆13Nov 18, 2015Updated 10 years ago
- Fast and memory-efficient svmlight / libsvm file loader for Python.☆117Aug 25, 2019Updated 6 years ago
- Fast Python Vowpal Wabbit wrapper☆13Mar 31, 2021Updated 4 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- ☆10Jun 14, 2014Updated 11 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- Data and code related to the paper "Probabilistic matrix factorization for automated machine learning", NIPS, 2018.☆40Nov 26, 2021Updated 4 years ago
- Scalable R for Machine Learning☆43Sep 11, 2018Updated 7 years ago
- Penalized least squares estimation using the Orthogonalizing EM (OEM) algorithm☆27Aug 8, 2024Updated last year
- Масштабируемое машинное обучение и анализ больших данных с Apache Spark☆21Mar 11, 2018Updated 7 years ago
- A python library to generate highly realistic typos (fuzz-testing)☆13Mar 16, 2025Updated 11 months ago
- Web Analytics for Hackers☆14Jun 17, 2016Updated 9 years ago
- Java port of c++ version of facebook fasttext☆13Nov 14, 2017Updated 8 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Extract plain or structured text from HTML content in R☆13Mar 1, 2019Updated 7 years ago