metarank / ranklens
Dataset for training ML ranking models
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ranklens
- Text similarity based on Word2Vec vectors.☆10Updated 7 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 4 years ago
- Friendly ML feature store☆45Updated 2 years ago
- Parquet Command-line Tools☆18Updated 8 years ago
- This repository contains recipes for Apache Pinot.☆24Updated last week
- Spark Parameter Optimization and Tuning☆31Updated 6 years ago
- ☆31Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Added functionality to the cml python package☆14Updated last month
- Kubeflow example of machine learning/model serving☆35Updated 4 years ago
- ☆22Updated 5 years ago
- Curated list of machine learning and deep learning frameworks and resources for JVM☆20Updated 3 years ago
- ☆21Updated 6 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- CuVS integration for Lucene☆29Updated 5 months ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Python bindings for Matroid API☆16Updated last month
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆36Updated 3 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated this week
- Temporal_Graph_library☆25Updated 5 years ago
- real-time data + ML pipeline☆54Updated this week
- Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.☆27Updated 2 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Demonstration code for MLeap, both Jupyter notebooks and projects☆24Updated 5 years ago
- InsightEdge Core☆20Updated 7 months ago
- ☆33Updated 8 months ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 9 years ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆17Updated last year
- Data Science with Apache Spark and Spark Notebook☆30Updated 7 years ago