elastic / eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
☆23Updated last week
Related projects ⓘ
Alternatives and complementary repositories for eland
- An Elasticsearch client exposing DataFrame API☆285Updated last year
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆108Updated 9 months ago
- Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.☆371Updated this week
- A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch☆399Updated 2 years ago
- Python Client for OpenSearch☆359Updated this week
- 🆕 A machine learning plugin which supports an approximate k-NN search algorithm for Open Distro.☆277Updated 3 years ago
- ☆339Updated last year
- 🆕 Find the k-nearest neighbors (k-NN) for your vector data☆156Updated this week
- Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch☆165Updated 2 months ago
- Fast Avro for Python☆645Updated this week
- Elastic Common Schema☆1,012Updated last week
- Entity resolution for Elasticsearch.☆157Updated 3 months ago
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆98Updated this week
- python implementation of the parquet columnar file format.☆787Updated last week
- Joblib Apache Spark Backend☆242Updated 3 months ago
- Distributed SQL Engine in Python using Dask☆397Updated 2 months ago
- Improve your Elasticsearch, OpenSearch, Solr, Vectara, Algolia and Custom Search search quality.☆284Updated this week
- Apache Airflow - OpenApi Client for Python☆358Updated last month
- Use dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly.☆20Updated 3 years ago
- Python package providing a simple interface to manipulate Elasticsearch queries and aggregations☆11Updated this week
- Read, write and update large scale pandas DataFrame with Elasticsearch☆35Updated last week
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆229Updated last week
- 🚎 Notebook sharing hub☆495Updated last year
- Metadata tracking and UI service for Metaflow!☆193Updated last week
- Kibana visualization like a Data Table, but with enhanced features like computed columns, filter bar, and “Split Cols” bucket☆309Updated this week
- Real-time stream processing for python☆1,244Updated 5 months ago
- S3 Filesystem☆890Updated last week
- A Python Client for Apache Superset REST API☆58Updated 11 months ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆116Updated 3 years ago
- OpenSearch Benchmark - a community driven, open source project to run performance tests for OpenSearch☆111Updated this week