elastic / elandLinks
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
☆683Updated 3 weeks ago
Alternatives and similar repositories for eland
Users that are interested in eland are comparing it to the libraries listed below
Sorting:
- An Elasticsearch client exposing DataFrame API☆284Updated 2 years ago
- A tool for batch loading data files (json, parquet, csv, tsv) into ElasticSearch☆401Updated 3 years ago
- Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.☆383Updated this week
- Python Client for OpenSearch☆426Updated last week
- 🆕 A machine learning plugin which supports an approximate k-NN search algorithm for Open Distro.☆283Updated 4 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆116Updated last year
- 🆕 Find the k-nearest neighbors (k-NN) for your vector data☆188Updated this week
- Cookiecutter API for creating Custom Skills for Azure Search using Python and Docker☆533Updated 2 years ago
- Improve your OpenSearch, Elasticsearch, Solr, Vectara, Algolia and Custom Search search quality.☆309Updated this week
- S3 Filesystem☆951Updated last week
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,236Updated 5 months ago
- Fuzzy string matching, grouping, and evaluation.☆771Updated last week
- Read, write and update large scale pandas DataFrame with Elasticsearch☆35Updated 7 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆503Updated 5 months ago
- Apache Airflow - OpenApi Client for Python☆406Updated last month
- Clean personally identifiable information from dirty dirty text.☆411Updated last year
- Fast Avro for Python☆676Updated last week
- a general utility for anonymizing data☆123Updated 3 weeks ago
- Joblib Apache Spark Backend☆249Updated 3 months ago
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆123Updated this week
- UnionML: the easiest way to build and deploy machine learning microservices☆335Updated last year
- Generate and Visualize Data Lineage from query history☆326Updated last year
- A tool for building feature stores.☆308Updated 2 weeks ago
- A plugin for Apache Airflow that allows you to edit DAGs in browser☆439Updated last month
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,015Updated last year
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆408Updated last week
- Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful …☆143Updated 11 months ago
- Super Fast String Matching in Python☆370Updated 4 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆471Updated 6 months ago
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago